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I (57) Abstract 

A synthetic strategy for the creation of 
large scale chemical diversity. Solid-phase 
chemistry, photolabile protecting groups, and 
photolithography are used to achieve light-di- 
rected spatlally-addressable parallel chemical 
^thesis. Binary masking techm'ques are uti- 
lized in one embodiment A reactor system, 
photorcmovablc protecting gnmps, and im- 
proved data collection and handling tech- 
niques are also disdosed. A technique for 
I screening linker molecules is also provided. 
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VERY lARGE SCAU! IimQBILT ZED POLYMER SYNTHESIS 

This application is related to the following 
United States applications: U.S. Serial No. 492,462, 
filed March 1, 1990; U*S. Serial No. 362,901, filed Jvme 
7, 1989; U.S. Serial No* 624,120, filed Deceinber 6, 1990; 
U.S. Serial No. 626,730, filed December 6, 1990; and U.S. 
Serial No. 624,114, filed December 6, 1990. Bach of 
these applications is incorporated herein by reference 
for all purposes. This application is also related to 
per application WO 90/15070 which was piiblished December 
13, 1990 2md is also incorporated by reference herein for 
all purposes. 

BACKGROUND OF THE INVENTION 
The present invention relates to the field 
of polymer synthesis. More specifically, the invention 
provides a reactor system, a masking strategy, 
photoremovable protecting groups, data collection and 
processing techniques, and applications for light 
directed synthesis of diverae polymer sequences on 
substrates. 
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SUMMMtY OF THE INVENTION 
Methods, apparatus, and compositions for 
synthesis and use of techniques for diverse polymer 
sequences on a substrate are disclosed, as well as 
applications thereof. 

According to one aspect of the invention, an 
improved reactor system for synthesis of diverse polymer 
sequences on a substrate is provided. According to this 
embodiment the invention provides for a reactor for 
contacting reaction fluids to a siabstrate; a system for 
delivering selected reaction fluids to the reactor; a 
translation stage for moving a mask or substrate from at 
least a first relative location relative to a second 
relative location; a light for illuminating the substrate 
through a mask at selected times; and an appropriately 
programmed digital computer for selectively directing a 
flow of fluids from the reactor system, selectively 
activating the translation stage, and selectively 
illuminating the substrate so as to form a pliirality of 
diverse: polymer sequences on the substrate at 
predetermined locations. 

The invention also provides a technique for 
selection of linker molecules in VLSIPS. According to 
this aspect of the invention, the invention provides a 
method of screening a plurality of linker polymers for 
use in binding affinity studies. The invention includes 
the steps of forming a plurality of liiiker polymers on a 
substrate in selected regions, the linker polymers formed 
by the steps of recursively: on. a. surface of a 
substrate, irradiating a portion of the selected regions 
to remove a protecting group, and contacting the stirface 
with a monomer; contacting the plurality of linker 
polymers with a ligand; and contacting the ligand with a 
labeled receptor. 
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According to another aspect of the invention, 
improved photoremovable protecting groups are provided. 
According to this aspect of the invention a compound 
having the formula: 




OMe 

wherein n = 0 or 1; Y is selected from the group 
consisting of an oxygen of the carboxyl group of a 
natural or unnatural amino acid, an amino group of a 
natural or unnatural amino acid, or the C-5' oxygen group 
of a natural or tinnatural deoxyribonucleic or ribonucleic 
acid; and r' independently are a hydrogen atom, a lower 
allcyl, aryl, benzyl, halogen, hydroxy 1, alkoxyl, thiol, 
thioether, amino, nitro, carboxyl, formate, formamido, 
sulfido, or phosphide group; and r' is a allcoxy, allqrl> 
aryl, hydrogen, or alkenyl group is provided. 

The invention also provides improved masking 
techniques for VLSIPS. According to one aspect of the 
masking technique, the invention provides an ordered 
method for forming a plurality of polymer sequences by 
sequential addition of reagents comprising the step of 
serially protecting and deprotecting portions of the 
plurality of polymer sequences for addition of other 
portions of the polymer sequences using a binary 
synthesis^ strategy. 

Improved data collection equipment and 
techniques are also provided. According to one 
embodiment, the instrumentation provides a system for 
determining affinity of a receptor to a llgand 
coB^rising: means for applying light to a surface of a 
substrate, the substrate comprising a plurality of 
ligands at predetermined locations, the means for 
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applying directing light providing simultaneous 
illumination at a plurality of the predetermined 
locations; and an array of detectors for detecting 
fluorescence at the plurality of predetermined locations. 
The invention further provides for improved data analysis 
techniques including the steps of exposing fluorescently 
IcU^elled receptors to a substrate, the substrate 
comprising a plurality of ligands in regions at known 
locations; at a plurality of data collection points 
within each of the regions, determining an amount of 
fluorescence from the data collection points; removing 
the data collection points deviating from a predetermined 
statistical distribution; and determining a relative 
binding affinity of the receptor from remaining data 
collection points. 

Protected amino acid N-carboxy anhydrides for 
use in polymer synthesis are also disclosed. According 
to this aspect of the invention, a compound having the 
following formula is provided: 




where R is a side chain of a natural or unnatural amino 
acid and X is a photoremovable protecting group. 

A further understanding of the nature and 
advantages of the inventions herein may be realized by 
reference to the remaining portions of the specification 
and the attached drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 schematically illustrates light-directed 

spatially-addressable parallel chemical synthesis; 

Fig. 2 schematically illustrates one example of 

light-directed peptide synthesis; 

Fig. 3 schematically illustrates the software 

for the automated system for synthesizing diverse polymer 

sequences; 

Fig. 4a and 4b illustrate operation of a 
program for polymer synthesis; 

Fig. 5 is a schematic illustration of a "pure'* 
binary masking strategy; 

Fig. 6 is a schematic illustration of a gray 
code binary masking strategy; 

Fig. 7 is a schematic illustration of a 
modified gray code binary masking strategy; 

Fig. 8a schematically illustrates a masking 
strategy for a fo\ir step synthesis; 

Fig. 8b schematically illustrates synthesis of 
all 400 peptide dimers; 

Fig. 9 is a coordinate map for the ten-step 
binary synthesis; 

Fig. 10 schematically illustrates a data 
collection system; 

Fig. 11 is a block diagram illustrating the 
architecture of the data collection system; 

Fig. 12 is a flov chart illustrating operation 
of software for the data collection/ analysis system; and 

Fig. 13 schematically illustrates one example 
of light-directed oligonucleotide synthesis. 
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DESCRIPTXOH OF THE SREFEBSED EMBODIMENTS 
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I. n«f initlons 

certain terms used herein are intended to have 
the following general definitions: 

1. rrMn piewentarv ; Refers to the topological 
compatibility or matching together of interacting 
surfaces of a ligand molecule and its receptor. 
Thus, the receptor and its ligand can be described 
as complementary, and furthermore, the contact 
surface characteristics are coa«>lementary to each 
other. 

2. Bpltope ; The portion of an antigen molecule which 
is delineated by the area of interaction with the 
subclass of receptors known as antibodies. 

3. T.iaand t A ligand is a molecule that is recognized 
by a particular receptor. Examples of llgands that 
can be investigated by this invention include, but 
are not restricted to, agonists and antagonists for 
cell membrane receptors, toxins and venoms, viral 
epitopes, hormones (e.g., opiates, steroids, etc.), 
hormone receptors, peptides, enzymes, enzyme 
substrates, cof actors, drugs, lectins, sugars, 
oligonucleotides, nucleic acids, oligosaccharides, 
proteins, and monoclonal antibodies. 

4. Monoaier t A member of the set of small molecules 

which can be joined together to form a polymer. The 
set of monomers includes but is not restricted to, 
for example, the set of common L-amino acids, the 
set of D-amino acids, the set of synthetic amino 
acids, the set of nucleotides and the set of 
pentoses and hexoses. As used herein, monomer 
refers to any m^nber of a basis set for synthesis of 
a polymer. Por example, dlmers of the 20 naturally 
occurring L-amino acids form a basis set of 400 
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monomers for synthesis of polypeptides. Different 
basis sets of monomers may be used at successive 
steps in the synthesis of a polymer. Furthermore^ 
each of the sets may include protected members, which 
are. modified after synthesis. 

5. yept^dte; A polymer in which the monomers are alpha 
amino acids and which are joined together through 
amide bonds and is alternatively referred to as a 
polypeptide. In the context of this specification 
it should be appreciated that the amino acids may be 
the L-optical isomer or the D-optical isomer. 
Peptides are often two or more amino acid monomers 
long, and often more than 20 amino acid monomers 
long. Standard abbreviations for amino acids are 
used Ce^g., P for proline). These abbreviations are 
included in Stryer^ Biochemistry , Third Ed., 1S88, 
which is incorporated herein by reference for all 
piirposes. 

6. SaMaiLiaii: Energy which may be selectively applied 
including energy having a wavelength of between 10'" 
and 10* meters including^ for example^ electron beam 
radiation, gamma radiation, x-ray radiation^ ultra- 
violet radiation, visible light, infrared radiation, 
microwave radiation, and radio waves. "Irradiation" 
refers to the application of radiation to a surface. 

7. I^eceptor; A molecule that has an affinity for a 
given Uganda Receptors may be naturally-occurring 
or synthetic molecules. Also, they can be employed 
in their unaltered state or as aggregates with other 
species. Receptors may be attached, covalently or 
noncovalently, to a binding member, either directly 
or via a specific binding substance. Examples of 
recept6rs which can be employed by this invention 
include, but are not restricted to, antibodies. 
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cell membrane receptors, monoclonal antibodies 
and antisera reactive with specific antigenic 
determinants (such as on viruses, cells, or other 
materials) , drugs, polynucleotides, nucleic acids, 
peptides, cof actors, lectins, sugars, 
polysaccharides, cells, cellular membranes, and 
organelles. Receptors are sometimes referred to in 
the art as anti-ligands. As the term receptors is 
used herein, no difference in meaning is intended. 
A "Ligand Receptor Pair" is formed when two 
macromolecules have combined through molecular 
recognition to form a complex. 

other examples of receptors which can be 
investigated by this invention include but are not 
restricted to: 

a) Tf4m.r>araaniinfl i^^eepters; Determination of 
ligands which bind to receptors, such as 
specific transport proteins or enzymes 
essential to survival of microorganisms, 

is useful for a new class of antibiotics. Of 
particular value would be emtibiotics against 
opportunistic fungi, protozoa, and those 
bacteria resistant to the antibiotics 
in cujnrent use. 

b) Enzymes ; Por instance, determining the binding 
site of enzymes such as the enzymes responsible 
for cleaving neurotransmitters provides useful 
information. Determination of ligands which 
bind to certain receptors to modulate the 
action of the enzymes which cleave the 
different neurotransmitters is \iseful in the 
development of drugs which can be used in the 
treatment of disorders of neurotransmission. 

c) antibodies t For instance, the invention may 
be useful in investigating the ligand-binding 
site on the antibody molecule which combines 
with the epitope of an antigen of interest; 
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determining a sequence that mimics an antigenic 
epitope may lead to the development of vaccines 
of which the immunogen is based on one or more 
of such sequences or lead to the development of 
related diagnostic agents or compounds useful 
in therapeutic treatments such as for auto- 
immune diseases (e.g., by blocking the binding 
of the ^»self" antibodies). 
^) Nucleic Acids r Sequences of nucleic acids may 
be synthesized to establish DNA or RNA binding 
sequences. 

^> Catalytic Polypeptides; Polymers, preferably 
polypeptides, which are capable of promoting a 
ichemical reaction involving the conversion of 
one or more reactants to one or more products « 
such polypeptides generally include a binding 
site specific for at least one reactant or 
reaction intermediate and an active 
functionality proximate to the binding site, in 
which the functionality is capable, of 
chemically modifying the bound reactant. 
catalytic polypeptides are described in, for 
example, V.S. application Serial No. 404,920, 
which is incorporated herein by reference for 
all purposes. 

f) Hormone receptors* For instance, the receptors 
for insulin and growth hormone. Determination 
of the ligands whidh bind with high affinity to 
a receptor is useful in the development of, 
for example, an oral replacement of the daily 
injections which diabetics must take to relieve 
the symptoms of diabetes, and in the other 
. case, a replacement for the scarce human 
growth hormone which can only be obtained from 
cadavers or by recombinant DNA technology. 
Otber exanples are the vasoconstrictive hormone 
receptors; determination of those ligands which 
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bind to a receptor may lead to the development 
of drugs to control blood pressure* 
g) Opiate receptors ; Determination of llgands 

which bind to the opiate receptors in the brain 
is useful in the development of less^addictive 
replacements for morphine and related drugs. 

8. substrate ; A material having a rigid or semi-rigid 
surface. In many embodiments, at least one surface 
of the substrate will be substantially flat, 
although in some embodiments it may be desirable to 
physically separate synthesis regions for different . 
polymers with, for example, veils, raised regions, 
etched trenches, or the like. According to other 
embodiments, small beads may be provldeid on the 
s\irf ace which may be released upon completion of the 
synthesis. 

9. y^otectino group : A material which is chemically 
bound to a monomer unit and which may be removed 
upon selective exposure to an activator such as 
electromagnetic radiation. Examples of protecting 
groups with utility herein include those coiq>rising 
nltroplperonyl, pyrenylmethoxy-carbonyl, nitrovera** 
tryl, nitrobenzyl, dimethyl dlmethoxybenzyl, 
5-bromo-7-nitrolndollnyl , o--hydroxy-a-methyl 
cinnamoyl, and 2*-oxymethylene anthraquinone. 

10. Predefined Reaion t A predefined region is a 
localized area on a surface which ie, was, or is 
intended to be activated for formation of a polymer. 
The predefined region may have any convenient shape, 
e.g., circular, rectangular, elliptical, wedge- 
shaped, etc. For the ssOce of brevity herein, 
"predefined regions** are sometimes referred to 
simply as "regions." 
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11. - Stibstaniiiallv Pure ; h polymer is considered to be 

n substantially ptire? within a predefined region of 
a substrate when it exhibits characteristics that 
distinguish it from other predefined regions. 
Typicsaiy, purity will be measured in terms of 
biological activity or function as a result of 
uniform sequence. Such characteristics, will 
typically be measured by way of binding with a 
selected ligand or receptor. 

12. Activator refers to an energy source adapted to 
render a group active and which is directed from a 
source to a predefined location on a siibstrate. A 
primary Illustration of an activator is light, 
other examples of activators include ion beams, 
electric fields, magnetic fields, electron beams, 
ray, and the lilce. 

13. Binary Synthesis Strategy refers to an ordered 
strategy for peorallel synthesis of diverse polymer 
sequences by sequential additiph of .reagents which 
may be represented by a reactant matrix, and a 
switch matrix, ^e product of which is a p]x>duct 
matrix. A reactant matrix is a 1 x m matrix of the 
building bloclcs to be added. The switch matrix is 
all. or a subset of the binary numbers, preferably 
ordered, between 1 and m arranged in columns. In 
preferred embodiments, a binary strategy is one in 
which at least two successive steps illuminate half 
of a region of interest on the substrate. In most 
preferred embodiments, binary synthesis refers to a 
synthesis strategy which also factors a previous 
addition step. For example, a strategy in which a « 
switch matrix for a masking strategy halves regions 

that were previously illuminated, illuminating about ^ 
half of the previously illuminated region and 
protecting the remaining half (while also protecting 



wo 92/10092 



PCr/US91/08693 
13 

about half of previously protected regions and 
illuninating about half of previously protected 
regions) . It will be recognized that binary rounds 
may be interspersed with non-binary rounds and that 
only a portion of a substrate may be subjected to a 
binary schene, but will still be considered to be a 
binary masking strategy within the definition 
herein. A binary "masking" strategy is a binary 
synthesis which uses light to remove protecting 
groups from materials for addition of other 
materials such as amino acids. In preferred 
embodiments, selected colunns of the switch matrix 
are arranged in order of increasing binary numbers 
in the columns of the switch matrix. 

14. T.tnker refers to a molecule or group of molecules 
attached to a substrate and spacing a synthesized 
polymer from the substrate for exposure/binding to a 
receptor. 

II. SsQSial 

The present invention provides synthetic 

strategies and devices for the creation of large scale 

chemical diversity. Solid-phase chemistry, photolabile 

protecting groups, and photolithography are brought 

together to achieve light-directed spatially-addressable 

parallel chemical synthesis in preferred embodiments. 

The invention is described herein for purposes 

of illustration primarily with regard to the preparation 

of peptides and nucleotides, but could readily be applied 

in the preparation of other polymers. Such polymers 

include, for example, both linear and cyclic polymers 

of nucleic acids, polysaccharides, phospholipids, and 

peptides having either a-, p-, or w-amino acids, hetero- 

polymers in which a known drug is covalently bound to any 

of the above, polyurethanes, polyesters, polycarbonates, 

polyureas, polyamides, polyethylenelmines, polyarylene 
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sulfides, polysiloxanes, polyimides, polyacetates, or 
other polymers which vill be apparent upon review of this 
disclosture. It will be recognized further^ that 
illustrations herein are primarily with reference to 
C- to N-terminal synthesis, but the invention could, 
readily be applied to N- to C-terminal synthesis without 
departing from the scope of the invention. 

A. Deprotection and Addition 

The present invention uses a masked light 
source or other activator to direct the simiiltaneous 
synthesis of many different chemical compounds. Fig. i 
is a flow cdiart illustrating the process of forming 
chemical confounds according to one embodiment of the 
invention. Synthesis occux^ on a solid support 2. A 
pattern of i llumi nation through a mask 4a using a light 
source 6 determines which regions of the support are 
activated for chemical coupling. In one preferred 
embodiment activation is accomplished by using light to 
remove photolabile protecting groups from selected areas 
of the substrate. 

After deprotection, a first of a set of 
building blocks (indicated by "A" in Fig. l) , each 
bearing a photolabile protecting group (indicated by »»X") 
is exposed to the surface of the substrate and it reacts 
with regions that were addressed by light in the 
preceding step. The substrate is then illuminated 
through a second mask 4b, ^rtiich activates another region 
for reaction with a second protected building block "B». 
The pattern of masks used in these illuminations and the 
sequence of reactants define the ultimate products and 
their locations, resulting in diverse sequences at 
predefined locations, as shown with the sequences ACEG 
and BDFH in the lower portion of Fig. !• Preferred 
embodiments of the invention take advantage of 
coinbinatorial masking strategies to form a large number 
of compounds in a small number of chemical steps. 
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A high degree a£ miniaturization is possible 
because the density of compounds is determined largely 
with regard to spatial addressability of the activator, 
in one case the diffraction of light. Each compound is 
physically accessible and its position is precisely 
known. Hence, the array is spatially-addressable and its 
interactions with other molecules can be assessed. 

In a particular embodiment shown in Pig. 1, the 
substrate contains amino groups that are blocked with a 
photolabile protecting group. Amino acid sequences are 
made accessible for coupling to a receptor by removal of 
the photoprotecting groups. 

When a polymer sequence to be synthesized is, 
for example, a polypeptide, amino groups at the ends of 
linkers attached to a glass substrate are derivatized 
with nitroveratryloxycarbonyl (NVOC) , a photoremovable 
protecting group. The linker molecules may be, for 
example, aryl acetylene, ethylene glycol oligomers 
containing from 2-10 monomers, diamines, diacids, amino 
acids, or conbinations thereof. Photodeprotection is 
effected by Illumination of the substrate through, for 
example, a mask wherein the pattern has transparent 
regions with dimensions of, for example, less than 1 cm^, 
10'^ cm^ 10'^ cm^, 10*^ cm^, 10"* cm^, 10"* cm^, 10"^ cm^, 
10"^ cm^, 10*^® cm^, or 10*^^ cm^. In a preferred embodiment, 
the regions are between about 10x10 and 500x500 im. 
According to some embodiments, the masks are arranged to 
produce a checkerboard array of polymers < although any 
one of a variety of geometric configurations may be 
utilized. 

1* Example 

In one example of the invention, free amino 
groups were f luorescently labelled by treatment of the 
entire substrate surface with fluorescein isothiocynate 
(FITC) after photodeprotection. Glass microscope 
slides were cleaned, aminated by treatment with 
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0.1% asilnopTOpYlt:riethoxysilahe in 95% ethseinol, and 
incubated at: 110 ""C for 20 min. The aminated surface 
of the slide Has then exposed to a 30 ^ solution of 
the H-hydroxysuccinimide ester of NVOC«-GAaA 
(nitroveratryloxycarbonyl--r-aiaino butyric acid) in DMFo 
The woe protecting group was photolytically removed by 
imaging the 365 nm output from a Eg arc lamp through a 
chrome dn glass 100 pm checkerboard mask onto the 
substrate for 20 min at a power density of 12 mWca?. The 
exposed surface Has then treated Hith 1 iM 7XTG in DHF. 
The substrate surface Has scanned in an epi^-f luorescence 
microscope (Zeiss Axioskop 20) using 488 nm excitation 
from an argon ion Is^er (Spectra »Physics model 2025} • 
The fluorescence ^aission above 520 nm was* detected by a 
cooled photomultiplier (Hamamatsu 943-02) operated in a 
photon counting mode. Fluorescence intensity Has 
translated into a color display nith red in the . highest 
intensity and black in the lonest intensity areas o The 
presence of a high-contrast fluorescent checkerbos^d 
pattern of 1003^100 $m elements revealed that free amino 
groups Here generated in specific regions by spatially- 
localised photodeprotectiono 

2o B^gample 

Fig» 2 is a floH chart illiastrating another 
example of the invention o Carboxy-activated ^OC-leucine 
Has alloHed to react nith an aminated substrate. The 
carbosy activated HOBT ester of. leucine and other amino 
acids used in this synthesis Has formed by mixing 
0.25 mmol of the NVOC amino protected amino acid Hith 
37 mg HOBT- (l-hydroxybenzotriazole) , 111 mg BOP 
(ben20tria2olyl-n°oxy-tris (dimethylamino) - 
phosphoniumhexa-fluorophosphate) and B6 (a1 DJEh 
(diisopropylethylamine) in 2.5 ml DB^. The WOC 
protecting group Has removed by uniform illumination. 
Carboxy-activated MVOC-phenylalanine Has coupled to the 
exposed raino groups for 2 hours at room temperature. 
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and then washed with DMF and methylene chloride. Two 
unmasked cycles of photodeprotection and coupling with 
carhoxy-activated NVOC-glycine were carried out. The 
surface was then illuminated through a chrome on glass 
50 ^m checkerboard pattern mask. Carboxy-activated 
Na-tBOC-O-tButyl-L-tyrosine was then added. The entire 
surface was uniformly illuminated to photolyze the 
remaining NVOC groups. Finally, carboxy-activated 
NVOC-Ii-proline was added, the NVOC group was removed by 
illumination, and the t-BOC and t-butyl protecting groups 
were removed with TPA. After removal of the protecting 
groups, the surface consisted of a 50 checkerboard 
array of Tyr-Gly-Gly-Phe-Leu (YGGFL) and Pro-Gly-Gly-Phe- 
Leu (PGGFL). See also SEQ ID H0;1 and SEQ ID NO: 2. 

B. Antibody Recognition 

In one preferred embodiment the s\ibstrate is 
used to determine which of a plurality of amino acid 
seguences is recognized by an antibody of interest. 

1. Example 

In one example, the array of pentapeptides in 
the example illustrated in Fig. 2 was probed with a moxise 
monoclonal antibody directed against /5-endorphin. This 
antibody (called 3E7) is known to bind YGGFL and YGGFM 
(see also SEQ ID N0:1 and SEQ ID N0:21) with nanomolar 
affinity and is discussed in Meo et al . , Proc. Natl. 
Aead. gei, USA (1983) 80:4084, Which is incorporated by 
reference herein for all purposes. This antibody 
requires the amino terminal tyrosine for high affinity 
binding. The array of peptides formed as described in 
Fig. 2 was incubated with a 2 Mg/iftl mouse monoclonal 
antibody (3E7) known to recognize YGGFL. See also SEQ ID 
N0:1. 3E7 does not bind PGGFL. See also SEQ ID NO: 2. A 
second incubation with f luoresceinated goat anti-mouse 
antibody labeled the regions that bound 3E7. The surface 
was scanned with an epi-fluorescence microscope. The 
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results showed alternating bright and dark 50 im squares 
indicating that YGGFL (SEQ ID N0:1) and PGGFL (SEQ ID 
NO: 2) were synthesized in a geometric array determined by 
the mask. A high contrast (>12:1 intensity ratio) 
fluorescence checkerboard image shows that (a) YGGFL (SEQ 
ID NO:l) and PGGFL (SEQ ID NO: 2) were synthesized in 
alternate 50 m squares^ (b) YGGFL (SEQ ID NO :1) attached 
to the surface is accessible for binding to antibody 3E7, 
and (c) antibody 3E7 does not bind to PGGFL (SEQ ID NO: 2) 

A three-dimensional representation of the 
fluorescence intensity data in a 2 square by 4 square 
rectangular portion of the checkerboard was produced. It 
shows that the border between synthesis sites is sharp. 
The height of each spike in this display is linearly 
proportional to the integrated fluorescence intensity in 
a 2.5 im pixel <r The transition between PGGFL and YGGFL 
occurs within two spikes (5 Mm) . There is little 
variation in the fluorescence intensity of different 
YGGFL squares. The mean intensity of sixteen YGGFL 
synthesis sites was 2.03x10^ counts and the standard 
deviation was 9.6x10^ counts. 

III. SYiHi^^irg 

A. Reactor Svstera 

Fig. 3 schematically illustrates a device used 
to synthesize diverse polymer sequences on a substrate. 
The device includes an automated peptide synthesizer 401. 
The automated peptide synthesizer is a device irtiich flows 
selected reagents through, a flow cell 402 under the 
direction of a computer 404» In a preferred embodiment 
the synthesizer is an ABI Peptide Synthesizer, model 
no. 431A. The computer may be selected from a wide 
variety of computers or discrete logic including for, 
example, an IBM PC-AT or similar computer linked with 
appropriate internal control systems in the peptide 
synthesizer. Uie PC is provided with signals from the 
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board computer indicative of ^ for example # the end of a 
coupling cycle. 

Substrate 406 is mounted on the flow cell, 
forming a cavity between the substrate and the flow cell. 
Selected reagents flow through this cavity from the 
peptide synthesizer at selected times, forming an array 
of peptides on the face of the substrate in the cavity « 
Mounted above the substrate, and preferably in contact 
with the substrate is a mask 408. Mask 408 is 
transparent in selected regions to a selected wavelength 
of light and is opaque in other regions to the selected 
wavelength of light. The mask is illuminated with a 
light source 410 such as a UV light source. In one 
specific embodiment the light source 410 is a model no. 
82420 made by Oriel. The mask is held and translated by 
an x-y*z translation stage 412 such as an x*y translation 
stage made by Newport Corp. The computer coordinates 
action of the peptide synthesizer, x-y translation stage, 
and light source. Of course, the Invention may be used 
in some embodiments with translation of the substrate 
Instead of the mask. 

In operation, the substrate is mounted on the 
flow cell. The substrate, with its surface protected by 
a suitable photo removable protecting group, is exposed 
to light at selected locations by positioning the mask 
and directing light from a light source, through the 
mask, onto the substrate for a desired period of time 
(such as, for exasqple, 1 sec to 60 min in the case of 
peptide synthesis). A selected peptide or other 
monomer/polymer is pumped through the reactor cavity by 
the peptide synthesizer for binding at the selected 
locations on the substrate. After a selected reaction 
time (such as about 1 sec to 300 min in the case of 
peptide reactions) the monomer is washed from the system, 
the mask is appropriately repositioned or replaced, and 
the cycle is repeated. In most embodiments of the 
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invention, reactions nay be conducted at or near ambient 
tenperature. . - 

Pigs 4a and 4b are flo^ charts of the software 
used in operation of the reactor system « At step 502 the 
peptide synthesis softafare is initialized. At step 504 
the system calibrates positioners, on the £»y trainslation 
stage and begins a main loop. . At step 506 the system 
determines whichf if any, of the function key& on the 
computer have been pressed. If Fl has been pressed, the 
system prompts the user for input of a desired synthesis 
process. If the user enters F2, the system allous a user 
to edit a file for a synthesis process at step 510 o If 
the user enters F3 the system loads a process from a disk 
at step 512. If the user enters F4 the system saves an 
entered or edited process to disk at step '514 « If the 
user selects F5 ^e current process is displayed at step 
516 while selection of F6 starts the main portion of the 
program, i.e., the actual synthesis according to the 
selected process o If the user selects F7 the system 
displays the location of the synthesized peptides, while 
pressing FIO returns the user to the disk operating 
system^i^ 

Fig. 4b illustrates the synthesis step 518 in 
greater detail . The main loop of the program is started 
in which the system first moves the mask to a nesct 
position at step 526. During the main loop of the 
program, necessary chemicals flow through the reaction 
cell under the direction of the on^board cosaputer in the 
peptide synthesiser. At step 528 the system then waits 
for an exposure command and, upon receipt of the exposure 
command exposes the substrate for a desired time at step 
530. ^en an acknowledgement of complete exposure is 
received at step 532 the system determines if the process 
is complete at step 534 and, if so, waits for additional 
keyboard input at step 536 and, thereafter, exits the 
perform synthesis process. 
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A computer program used for operation of 
the system described above is written in Turbo C. (Borland 
Int'l) and has been implemented in an 
IBM compatible system. The motor control software is 
adapted from software produced by Newport Corporation. 
It will be recognized that a large variety of programming 
languages could be utilized without departing from the 
scope of the Invention herein. Certain calls are made to 
a graphics program in "Programmer Guide to PC and PS2 
video systems" (Wilton, Microsoft Press, 1987), which is 
incorporated herein by reference for all purposes. 

Alignment of the mask is achieved by one of two 
methods in preferred embodiments. In a first embodiiment 
the system relies upon relative alignment of the various 
components, ^ch is normally acceptable since x-y-z 
translation stages are capable of sufficient accuracy for 
the purposes herein. In alternative embodiments, 
alignment marks on the substrate are coupled to a CCD 
device for appropriate alignment. 

According to some embodiments, pure reagents 
are not added at each step, or cc»q>lete photolysis of 
the protecting groups is not provided at each step. 
According to these embodiments, multiple products will 
be formed in each synthesis site. For example, if the 
monomers A and B are mixed during a synthesis step, A and 
B will bind to deprotected regions, roughly in proportion 
to their concentration in solution. Hence, a mixture of 
compounds will be formed in a synthesis region. A 
substrate formed with mixtures of compounds in various 
synthesis regions may be used to perform, for example, an 
initial screening of a lairge number of compounds, after 
which a smaller number of compovuids in regions which 
exhibit high binding affinity are further screened. 
Similar results may be obtained by only partially 
photolyzing a region, adding a first monomer, 
re-photolyzing the same region, and exposing the 
region to a second monomer. 
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B. fiinarv Svn i^hegis Strategy 

In a light-directed chemical synthesis, the 
products formed depend on the pattern cind order of masks, 
and on the order of reactants. To make a set of products 
there will in general be a finite nuniber of possible 
masking strategies. In preferred embodiments of the 
invention herein a binary synthesis strategy is utilized. 
The binary synthesis strategy is illustrated herein 
primarily with regard to a masking strategy, although it 
will be applicable to other polymer synthesis strategies 
such as the pin strategy, and the like. 

In a binary synthesis strategy, the substrate 
is irradiated with a first mask, exposed to a first 
building block, irradiated with a second mask, reposed to 
a second building block, etc. Each combination of masked 
irradiation and exposure to a building block is referred 
to herein as a "cycle.** 

In a preferred binary masking strategy, the 
masks for each cycle allow illumination of half of a 
region of interest on the substrate, and no illumination 
of the remaining half of the region of interest. By 
"half" it is intended herein not to mean exactly one-half 
the region of interest, but instead a large fraction of 
the region of interest such as from cibout 3 a to 70 
percent of the region of interest. It will be understood 
that the entire masking strategy need not take a binary 
form? instead non-binary cycles may be introduced as 
desired between binary cycles. 

In preferred embodiments of the binary masking 
Strategy, a given cycle illuminates only about half of 
- the region which was illuminated in a previous cycle, 
while not illuminating the remaining half of the 
illuminated portion from the previous cycle. Conversely, 
in such preferred embodiments, a given cycle illuminates 
half of the region which was not illuminated in the 
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previous cycle and does not illuminate half the region 
which was not illuminated in a previous cycle. 

in the synthesis strategy disclosed herein, the 
longest length (i) of the synthesized polymers is 
t = n/a; where n is the number of cycles and a is the 
number of chemical building blocks (note that a given 
building block may be repeated) . 

The synthesis strategy is most readily 
illustrated and handled in matrix notation. At each 
synthesis site, the determination of whether to add a 
given monomer is a binary process. Therefore, each 
product element P, is given by the dot product of two 
vectors, a chemical reactant vector, e.g., C = (&,B,C,D1, 
and a binary vector a,, inspection of the products in the 
example below for a four-step synthesis, shows that in 
one four-step synthesis ff^ = ti»o,i,o], = [1,0,0,1], 
a, = [0,1,1,01, and = [0,1,0,1], where a 1 indicates 
illumination and a 0 indicates no illumination. 
Therefore, it becomes possible to build a -switch matrix" 
s from the column vectors a, (j » l,k where k is the 
number of products) . 

ffi ft a, "4 
S ■ 1 1 0 0 
0 0 11 
10 10 
0 10 1 



The outcome P of a synthesis is simply P =» CS, the 
product of the chemical reactant matrix and the switch 
matrix. 

The switch matrix for an n-cyole synthesis 
yielding k products has n rows and k columns. An 
important attribute of S is that each row specifies a 
mask. A two-dimensional mask m, for the jth chemical step 
of a synthesis is obtained directly from the jth row of S 
by placing the elements s,i,...s,„ into, for example, a 
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square format. The particular arrangement b^low 
provides a square format., although linear or other 
arrangements may be utilized. 



= s„ 


S,2 




Sl4 


mj = 








s» 


Sj, 












Sn 














Sa 




S44 









Of cotirse, compounds formed in a light- 
activated synthesis can be positioned in any defined 
geometric array • A square or rectangular matrix is 
convenient but not required. The rows of the svitch • 
matrix may be transformed into any convenient array as 
long as equivalent transformations are used for each row. 

For example, the masks in the four-step 
synthesis below are then denoted by: 

mi = ll m2-00 %=10 ii^«Ol 

00 11 10 01 - 

where l denotes illumination (activation) and 0 denotes 
no illumination • 

The matrix representation is used to generate a 
desired set of products and product maps in preferred 
embodiments. Each compound is defined by the product of 
the chemical vector and a particular switch vector. 
Therefore f for each synthesis address, one simply saves 
the switch vector, assembles all of them into a switch 
matrix, and extracts each of the rows to form the masks. 

In some cases, particular product distributions 
or a maximal number of products are desired. For 
example, for C ^ [A,B,C^D], any switch vector (Oj) 
consists of four bits. Sixteen four-bit vectors exist. 
Hence, a maximum of 16 different products can be made by ' 
sequential addition of the reagents [A#B,C,D]. These 16 
column vectors can be assembled in 16! different ways to 
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foriD a switch nalrrix. The order of the coliunn vectors 
defines the masking patterns^ and therefore, the spatial 
ordering of products but not their makeup. One ordering 
of these columns gives the following switch matrix (in 
which "null" (0) additions are included in brackets for 
the sake of completeness, although such null additions 
are elsewhere ignored herein) : 



1 


1 


1 


1 


1 


1 


1 


1 


0 


0 


0 


0 


0 


0 


0 


0 


A 


[0 


0 


0 


0 


0 


0 


0 


0 


1 


1 


1 


1 


1 


1 


1 


13 


0 


S = 1 


1 


1 


1 


0 


0 


0 


0 


1 


1 


1 


1 


0 


0 


0 


0 


B 


[0 


0 


0 


0 


1 


1 


1 


1 


0 


0 


0 


0 


1 


1 


1 


1) 


0 


1 


1 


0 


0 


1 


1 


0 


0 


1 


1 


0 


0 


1 


1 


0 


0 


c 


[0 


0 


1 


1 


0 


0 


1 


1 


0 


0 


1 


I 


0 


0 


1 


1] 


0 


1 


0 


1 


0 


1 


0 


1 


0 


1 


0 


1 


0 


-1 


0 


1 


0 


D 


to 


1 


0 


1 


0 


1 


0 


1 


0 


1 


0 


1 


0 


1 


0 


1] 


0 



The columns of S according to this aspect of the 
invention are the binary representations of the numbers 
15 to 0, The sixteen products of this binary synthesis 
are ABCD, ABC, ABD, AB, ACD, AC, AD, A, BCD, BC, BD, B, 
CD, C, D, and 0 (null) . Also note that each of the 
switch vectors from the four-*step synthesis masks above 
(and hence the synthesis products) are present in the 
four bit binary switch matrix. (See columns 6, 7, 10,. 
and 11) 

This synthesis procedure provides an 
easy way for mapping the completed products. The 
products in the various locations on the substrate- are 
simply defined by the columns of the switch matrix (the 
first column indicating, for example, that the product 
ABCD will be present in the upper left-hand location of 
the substrate) • Furthermore, if only selected desired 
products are to be made, the mask sequence can be derived 
by extracting the columns with the desired sequences. 
For example, to form the product set ABCD, ABD, ACD, 



wo 92/10092 



26 



PCr/US91/08693 



BCD', BD, CD, and D, the masks are formed by use of a 
switch matrix with only the 1st, 3rd, 5th, 7th, 9th, 
llth, 13th, and I5th columns arranged into the switch 
matrix: 
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To form all of the polymers of. length 4, the reactant 
matrix [7VBCDABCDABCDABCD] is used. The switch matrix 
will be formed from a matrix of the binary numbers from 0 
to 2^^ arrsmged in columns. The columns having four 
monomers are then selected and arranged into a switch 
matrix. Therefore, it is seen that the binary switch 
matrix in general will provide a representation of all 
the products which can be made from an n<»step synthesis, 
from which the desired products are then extracted. 

The rows of the binary switch matrix will, in 
preferred embodiments, have the property that each 
masking step illuminates half of the synthesis area. 
Each masking sliep also factors the preceding masking 
stepf that is, half of the region that was illuminated in 
the preceding step is ^gain illuminated, whereas the 
other half is not. Half of the region that was not 
illuminated in the preceding step is also illuminated, 
whereas the other half is not. Thus, masking is 
recursive. The masks are constructed, as described 
previously, by extracting the elements of each row and 
placing them in a square array. For example, the four 
masks in S for a four-step synthesis are: 

mt=llii m2«llll % = 1100 104-1010 

1111 0000 1100 101 0 

0000 1111 1100 1010 

OOOO 00 00 1100 1010 
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The recursive factoring of masks allows tAe 
.products of a light-directed synthesis to be represented 
by a polynomial. (Some light activated syntheses can 
only be denoted by irreducible, i.e., prime polynomials.) 
For example, the polynomial corresponding to the top 
synthesis of Fig. 8a (discussed below) is 

P = (A + B) (C + D) 

A reaction polynomial may be expanded as though it were 
an algebraic expression, provided that the order of 
joining of reactants X, and Xj is preserved (XjXi?* XjX,) , 
i.e., the products are not commutative. The product then 
is AC + AD + BC + BD. The polynomial explicitly 
specifies the reactants and implicitly specifies the mask 
for eacJh step. Each pair of parentheses demarcates a 
round of synthesis. The chemical reactants of a round 
(e.g. , A and B) react at nonoverlapping sites and hence 
cannot combine with one another. The synthesis ar«a is 
divided equally amongst the elements of a round (e.g., A 
is directed to one-half of the area and B to the other 
half). Hence, the masks for a round (e.g., the masks mA 
and %) are orthogonal and form an oriAonormal set. The 
polynomial notation also signifies that each element in a 
round is to be joined to each element of the next round 
(e.g., A with C, A with D, B with C, and B with D) . This 
is accomplished by having mc overlap m„ and equally, 
and likewise for %. Because C and D are elements of a 
round, mc and mo are orthogonal to each other and form an 

orthonormal set. 

The polynomial representation of the binary 
synthesis described above, in which 16 products are made 
from 4 reactants, is 

p » (A + 0) (B + 0) (c + 0) (D + 0) 
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which gives ABCD, ABC, ABD, KB, kCD, AC, AD, A, BCD, BC, 
BD, B, CD, C, D, and 0 when expanded (with the rule that 
26c = X and 20 - X, and remembering that joining is 
ordered). In a binsury synthesis, each round contains one 
reactant and one null (denoted by 0) . Half of the 
synthesis area receives the reactant and the other half 
receives nothing. Each mask overlaps every other mask 
equally. 

Binary rounds and non-*binary rounds can be 
interspersed as desired, as in 

P = (A + 0) (B) (C +-D + 0) (E + P + G) 

The 18 confounds formed are ABCE, ABCF, ABCG, ABDE, ABDF, 
ABD6, ABE, ABF, ABG, BCE, BCF, BC6, BDE, BDF, BD6, BE, 
BF, and B6« The switch matrix S for this 7-step 
synthesis is 
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The roxmd denoted by (B) places B in all products because 
the reaction area was unifoinnly activated (the mask for B 
consisted entirely of l*s). 

The number of compounds k formed in a synthesis 
consisting of r rounds, in which the ith round has b| 
chemical reactants and Z| nulls, is 



k « E <bi+2i> 
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and the nuaiber of chemical steps n Is 

n » Sbj 

The number of compounds synthesized when b » a (the 
number of chemical building blocks) and z = 0 in all 
rounds is a"", compared with 2» for a binary synthesis. 
For n = 20 and a = 5, 625 compounds (all tetramers) would 
be formed, compared with 1.049x10* compounds in a binary 
synthesis with the same n\amber of chemical steps. 

It should also be noted that rounds in a 
polynomial can be nested, as in 

(A + (B40) (C40) ) (D-10) 

The products are AD, BCD, BD, CD, D, A, BC, B, C, and 0. 

Binary syntheses are attractive for two 
reasons. First, they generate the maximal number of 
products (2") for a given number of chemical steps (n) . 
For four reactants, 16 compounds are formed in the binary 
synthesis, whereas only 4 are made when each round has 
two reactants. A lO-step binary synthesis yields 1,024 
compounds, and a 20-step synthesis yields 1,048,576. 
Second, products formed in a binary synthesis are a 
complete nested set with lengths ranging from 0 to n. 
All compotinds that can be formed by deleting one or more 
units from the longest product (the n-mer) are present. 
Contained within the binary set are the smaller sets that 
would be formed from the same reactants using any other 
set of masks (e.g., AC, AD, BC, and BD formed in the 
synthesis shown in Fig. 5 are present in the set of 16 
formed by the binary synthesis). In some cases, however, 
the experimentally achievable spatial resolution may not 
suffice to accommodate all the compounds formed. 
Therefore, practical limitations may require one to 
select a particular subset of the possible switch vectors 
for a given synthesis. 
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1. Example 

Fig. 5 illustrates a synthesis with a binary 
masking strategy. The binary masking strategy provides 
the greatest niunber of secpiences for a given number of 
cycles. According to this embodiment, a mask allows 
illumination of half of the substrate. The substrate is 
then exposed to the building block A, which binds at the 
illuminated regions. 

Thereafter, the mask lOj allows illumination of 
half of the previously illuminated region, while it does 
not illuminate half of the previoxxsly illuminated region. 
The building block B is then added, which binds at the 
illuminated regions: from m,. 

The process continues with masks m,, m4, and m^j, 
resulting in the product array shown in the bottom 
portion of the figure. The process generates 32 (2 
raised to the power of the number of monomers) sequences 
with 5 (the number of monomers) cycles. 

2* Example 

Fig. 6 illustrates another preferred binary 
masking strategy which is: referred to herein as the gray 
code masking strategy. According to this embodiment, the 
masks m, to ms are selected such that a side of any given 
synthesis region is defined by the edge of only one mask. 
The site at which the sequence BCDB is formed, for 
example, has its right edge defined by ms and its left 
side formed by mask 11H4 (and no other mask is aligned on 
the sides of this site) . Accordingly, problems created 
by misalignment, diffusion of light under the mask and 
the like will be minimized. 

3. Example 

Fig. 7 illustrates another binary masking 
strategy. According to this scheme, refexrred .to herein 
as a modified gray code masking strategy, the number of 
masks needed is minimized. For example, the mask m, could 
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be the same mask as ml and simply translated latttally. 
Similarly, the mask m4 could be the same as mask m3 and 
simply translated laterally. 

4. Example 

A four-step synthesis is shown in Fig. 8a. The 
reactants are the ordered set {A,B,C,D>« In the first 
cycle, illumination through m, activates the upper half of 
the synthesis area. Building block A is then added to 
give the distribution 602. Illumination through mask mj 
(which activates the lower half) , followed by addition of 
B yields the next intermediate distribution 604 . C is 
added after illumination through (which activates the 
left half) giving the distribution 604, and D after 
illumination through m4 (which activates the right half), 
to yield the final product pattern 608 {AC,AD,BC,BD}. 

5. Exaytp^e 

The above masking strategy for the synthesis 
may be extended for all 400 dipeptides from the 20 
naturally occurring amino acids as shown in Fig. Bb. The 
synthesis consists of two rounds, with 20 photolysis and 
chemical coupling cycles per round. In the first cycle 
of round 1, mask 1 activates l/20th of the substrate for 
coupling with the first of 20 amino acids. Nineteen 
subsequent illumination/coupling cycles in round 1 yield 
a substrate consisting of 20 rectangular stripes each 
bearing a distinct member of the 20 amino acids. The 
masks of round 2 are perpendicular to round 1 masks and 
therefore a single illumination/ coupling cycle in round 2 
yields 20 dipeptides. The 20 illumination/coupling 
cycles of round 2 complete the synthesis of the 400 
dipeptides. 

6. sample 

The power of the binary masking strategy can be 
appreciated by the outcome of a lO-step synthesis that 
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produced 1,024 pep€ides« The polynomial expression for. 
this 10-step binary synthesis was: 

{f-10) (Y-f0) (G40) (A40) (G40) (T40) (F-10) (LH0) {S40) (F40) 

Each peptide occupied a 400x400 /xm square. A 
32x32 peptide array (1,024 peptides, including the null 
peptide and 10 peptides of £ = 1, and a limited number of 
duplicates) was clearly evident in a fluorescence scan 
following side group deprotection and treatunent with the 
antibody 3E7 and f luoresceinated antibody. Each 
synthesis site was a 400x400 ^m square. 

The scan showed a range of fluorescence 
intensities, from a background value of 3,300 counts to 
22,400 counts in the brightest square (x « 20, y = 9) . 
Only IS compounds exhibited an intensity greater than 
12,300 counts* The median value of the array was 4 #800 
counts. 

The identity, of each peptide in the array could 
be determined from its x and y coordinates (each range 
from 0 to 31) and the map of Fig. 9. The chemical units 
at positions 2, 5, 6, 9, and 10 are specified by the y 
coordinate and those at positions 1, 3, 4, 7 it 8 by the x 
coordinate. All but one of the peptides was shorter than 
10 residues. Fdr example, the peptide at x =^ 12 and 
y 5= 3 is YGAGF (SEQ ID NOtS; positions 1, 6, 8, 9, and 10 
are nulls). YGAFLS (SEQ ID Na:4>, the brightest element 
of the array, is at x = 20 and y « 9. 

It is often desirable to deduce a binding 
affinity of a given peptide from the. measured 
fluorescence intensity.. Conceptually, the simplest case 
is one in which a single peptide binds to a univalent 
antibody molec\ile. The fluorescence sccm is carried out 
after the slide is washed with buffer for a defined time. 
The order of fluorescence intensities is then a measure 
primarily of the relative dissociation rates of the 
antibodypeptide complexes, rf the on~rate constants are 
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the sane (e.g», if they are dif f uslon^controlled) , the 
order of fluorescence intensities will typically 
correspond to the order of binding affinities. However, 
the situation is sometimes more complex because a 
bivalent primary antibody and a bivalent secondary 
antibody are used. The density of peptides in a 
synthesis area corresponded to a mean separation of 
-7 nm, which would allow multivalent antibody-peptide 
interactions. Hence, fluorescence intensities obtained 
according to the method herein will often be a 
qualitative indicator of binding affinity. 

Another important consideration is the fidelity 
of synthesis. Deletions are produced by incomplete 
photodeprotection or incomplete coupling. The coupling 
yield per cycle in these experiments is typically between 
85% and 95%. Implementing the switch matrix by masking 
is imperfect because of light diffraction, internal 
reflection, and scattering, consequently, stowaways 
(chemical tinits that should not be on board) arise by 
tinintended illumination of regions that should be dark. 
A binary synthesis array contains many of the controls 
needed to assess the fidelity of a synthesis. For 
example, the fluorescence signal from a synthesis area 
nominally containing a tetr^peptide ABCD could come from 
a tripeptide deletion impurity such as ACD. Such an 
artifact would be ruled out by the finding that the 
fluorescence intensity of the ACD site is less than that 
of the ABCD site. 

The fifteen most highly fluorescent peptides in 
the array obtained with the synthesis of 1,024 peptides 
described above, were Y6AFLS (SEQ ID N0:4), yCAFS (SEQ ID 
N0:5), Y6AFL (SEQ ID N0:6), YG6FLS (SEQ ID N0:7), YGAF 
(SEQ ID N0:8), YGALS (SEQ ID N0:9), YGGFS (SEQ ID NO:10), 
YGAL (SEQ ID NO: 11), YGAFLF (SEQ ID NO: 12), YGAF (SEQ ID 
N0:8), YGAFF (SEQ ID NO:13), YGGLS (SEQ ID NO: 14), YGGFL 
(SEQ ID NO:l and SEQ ID N0:15), YGAFSF (SEQ ID N0:16), 
and Y6AFLSF (SEQ ID NO: 17) . A striking feature is that 
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all fifteen begin with Y6, which agrees with previous 
work showing that an aiftino-tarminal tyrosine is a key 
determinsuit of binding to 3£7, Residue 3 of this set is 
either A or G, and residue 4 is either F or L. The 
exclusion of S and T from these positions is clear cut. 
The finding that the preferred sequence is YG (A/G) (F/L) 
fits nicely with the outcome of a study in which a very - 
large librairy of peptides on phage generated by 
recombinant DNA methods was screened for binding to 
antibody 3E7 (see Cwirla et al, , Proc, Watl. Acad> Sci. 
USA , (1990> 57:6378^ incorporated herein by reference). 
Additional, binary syntheses based on leads from peptides 
on phage experiments show that YGAFMQ (S£Q ID HO: 18) , 
YGAFH (SBQ ID NO:i9), and Y6AFQ (SEQ ID N0:20) give 
Stronger fluorescence signals than does YGGFH (SEQ ID 
K0:21}, the immunogen used to obtain antibody 3S7. 

Variations on the above masking strategy will 
be valuable in certain circumstances. For example, if a 
^'kernel** secpience of interest consists of PQR separated 
from XYZ, the aim is to synthesize peptides in which 
these units are sepsurated by a variable number of 
different residues. The kernel can be placed in each 
peptide by using a mask that has 1*8 everywhere. The 
polynomial representation of a suitable synthesis is: 

(P) (tt) (R) (A-J0) (B-H3) (C-K3) (D40)i (X) (Y) (Z) 

Sixteen peptides will be formed, ranging in length , from 
the 6-mer PQRXYZ to the 10-mer PQRABCDXYZ. 

Several other masking strategies will also find 
value in selected circumstances. By using a particular 
mask more than once, two or more reactants will appear in 
the same set of products. For example, suppose that the 
mask for an 8 -step synthesis is 

A 11110000 
B 00001111 
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C IlOOllOO 

D 00110011 

£ 10101010 

F 01010101 

G 11110000 

H 00001111 



The products are ACEG, ACFG^ ADEG, ADF6, BCEH, 
BCFH, BDEH, and BDFH. A and G always appear in the sane 
product, although not necessarily next to each other, 
because their additions were directed by the same mask, 
and likewise for B and 

C. ^inHer Selection 

According to preferred embodiments the linker 
molecules used as an intermediary between the synthesized 
polymers and the substrate are selected for optimum 
length and/or type for improved binding interaction with 
a receptor. According to this aspect of the invention 
diverse linkers of varying length and/ or type are 
synthesized for sxxbsequent attachment of a ligand. 
Through variations in the length and type of linker, it 
becomes possible to optimize the binding interaction 
between an immobilized ligand and its receptor. 

The degree of binding between a ligand 
(peptide, inhibitor, hapten, drug, etc.) and its receptor 
(enzyme, antibody, etc.) when one of the partners is 
immobilized on to a substrate will in some embodiments 
depend on the accessibility of the receptor in solution 
to the immobilized ligand. The accessibility in turn 
will depend, on the length and/ or type of linker molecule 
employed to immobilize one of the partners. Preferred 
embodiments of the invention therefore employ the VIiSIPS 
technology described herein to generate an array of, 
preferably, inactive or inert linkers of varying length 
and/ or type, using. photochemical protecting groups to 
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selectively expose different regions of the substrate and 
to build upon che]iiically->*actlve groups • 

In the simplest einbodiment of this concept, 
the same unit Is attached to the substrate In varying 
multiples or lengths In known locations on the substirate 
via VLSIP& techniques to generate an array of polymers of 
varying length. A single llgand (peptide, drug, hapten, 
etc.) is attached to each of them, and an assay is 
performed with the binding site to evaluate the degree of 
binding with a receptor that is known to bind to the 
llgand. In cases where the liiiJcer length Impacts the 
ability of the receptor to bind to the llgand, vaurying 
levels of binding will be observed. In general, the 
linker which provides the highest binding will then be 
used to assay other llgands synthesized in accordance 
with the techniques herein. 

According to other embodiments the binding 
between a single llgand/receptor pair is evaluated for 
linkers of diverse monomer sec[uence* According to these 
embodiments, the linkers are synthesized in an array in 
accordance with the techniques herein and have different 
monomer sequences (and, optionally, different lengths) . 
ddiereafter, all of the linker molecules are provided with 
a llgand known to have at least some binding affinity for 
a given receptor. The given receptor is then exposed to 
the llgand and binding affinity is deduced. Linker 
molecules which provide adequate binding between the 
llgand and receptor, are then utilized in screening 
studies. 

D. Protecting Groups 

As discussed above, selectively removable 
protecting groups allow creation of well defined areas of 
substrate surface having differing reactivities. 
Preferably, the protecting groups are selectively removed 
from the surface by applying a specific activator, such . 
as electromagnetic radiation of a specific wavelength and 
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inlfenslty. More preferably, the specific activator 
exposes selected areas of the surface to remove the 
protecting groups in the exposed areas. 

Protecting groups of the present invention are 
used in conjunction with solid phase oligomer syntheses, 
such as peptide syntheses using natural or unnatural 
amino acids, nucleotide syntheses using deoxyribonucleic 
and ribonucleic acids, oligosaccharide syntheses, and the 
like. In addition to protecting the substrate surface 
from unwanted reaction, the protecting groups block a 
reactive end of the monomer to prevent 
self-polymerization. For instance, attachment of a 
protecting group to the amino terminus of an activated 
amino acid, such as an N^hydroxysuccinimide-activated 
ester of the amino acid, prevents the amino terminus of 
one monomer from reacting with the activated ester 
portion of another during peptide synthesis. 
Alternatively r the protecting group may be attached to 
the carboxyl group of an amino acid to prevent reaction 
at this site. Most protecting groups can be attached to 
either the amino or the carboxyl group of an amino acid, 
and the nature of the chemical synthesis will dictate 
which reactive group will require a protecting group. 
Analogously, attachment of a protecting group to the 
5*-hydroxyl group of a nucleoside during synthesis using 
for example, phosphate-triester coupling chemistry, 
prevents the 5*-hydroxyl of one nucleoside from reacting 
with, the 3 « --activated phosphate-triester oC another* 
Regardless of the specific use, protecting 
groups are employed to protect a moiety on a molecule 
from reacting with another reagent. Protecting groups of 
the present invention have the following characteristics: 
they prevent selected reagents from modifying the group 
to which they are attached; they are stable (that is, 
they remain attached to the molecule) to the synthesis 
reaction conditions; they are removable under conditions 
that do not adversely affect the remaining structure; and 
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once removed, ^ey do not react appreciably with the 
surface or surface-bound, oligomer* The selection of a 
suitable protecting group will depend, of course, on the 
chemical nature of the monomer unit and oligomer, as. veil 
as the specific reagents they are to protect against. 

In a preferred embodiment, the protecting 
groups are photoactivatable.. The properties and uses of 
photoreactive protecting compounds have been reviewed. 
See, HcCray e£ Sl« # Ann. Rev, of Biophvs> and Bionhvs. 
C!hem- (1989) 18:239<*270, which is inccScporated herein by 
reference. Preferably, the photosensitive protecting 
groups will be removable by radiation in the ultraviolet 
(UV) or visible portion of the electromagnetic spectrum. 
More. preferably, the protecting groups will be removable 
by radiation in the near UV or visible portion of the 
spectrum. In some embodiments, however, activation, may 
be performed by other methods such as localized' heating, 
electron beam lithogzraphy, laser pumping, oxidation or 
reduction with microelectrodes, and the like. Sulfonyl 
compounds are suitable reactive groups for electron beam 
lithography, oxidative or reductive removal is 
accomplished by exposure of the protecting group to an 
electric current source, preferably tising microelectrodes 
directed to the predefined regions of the surface whicdi 
are desired for activation. Other method? may be used in 
light of this disclosure. 

Many, although not all, of the photoremovable 
protecting groups will be aromatic compounds that absorb 
near*nv and visible radiation. Suitable photoremovable 
protecting groups jare described in, for example, Mccray 
et^., Patchomik, J. Amer. Chem. Soc. (197a) 92:6333, 
and Amit sfe al. # J> Org. Chem. (1974) 3£:192, which are 
incorporated herein by reference. 

A preferred class of photoremovable protecting 
groups has the general formula: 



1 
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where R^, R^r and R* independently are a hydrogen atom, 
a lower alkyl, aryl, benzyl, halogen, hydroxyl, alkoxyl, 
thiol, thioether, amino, nitro, carboxyl, formate, 
formamido or phosphido group, or adjacent substituents 
(i.e., R^-R^r R^-R^f R^-R*) are substituted oxygen groups 
that together form a cyclic acetal or ketal; R^ is a 
hydrogen atom, a alkoxyl, alkyl, halo, aryl, or alkenyl 
group, and n « 0 or 1. 

A preferred protecting group, 6-nitroveratryl 
(NV) , which is used for protecting the carboxyl terminus 
of an amino acid or the hydroxyl group of a nucleotide, 
for example, is formed when R^ and R^ are each a methoxy 
group, R^, R"* and R^ are each a hydrogen atom, and n = 0: 




A preferred protecting group, 
6-nitroveratryloxycarbonyl (HVOC), iimich is used to 
protect the amino tenainus of an amino acid, for example, 
is formed when R^ and R^ are each a methoxy group, R^, R* 
and R^ are each a hydrogen atom, and n - l: 



O NO2 



OMe 

OMe 
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Another preferred protecting group , 
6-nitropiperonyl (NP) , which is used for protecting the 
carboxyl terminus of an amino acid or the hydroxyl group 
of a nucleotide, for example, is formed when and V? 
together form a methylene acetal, R^,. and R^ are each a 
hydrogen atom, and n » 0: 




Another. preferred protecting group, 
6-nitropiperonyloxycarbonyl (NFOC) , which is used to 
protect the amino terminus of an amino acid, for example, 
is formed when R^ and R^ together form a methylene acetal, 
R^, R"* and R^ are each a hydrogen atom, and n » 1: 




A most preferred protecting group, 
methyl-»6-nitroveratryl (MeW), which is used for 
protecting the carisoacyl terminus of an amino acid or the 
hydroxyl group of a nucleotide, for example, is formed 
when R^ and R^ are each a methoxy group, R^ and R^ are .. 
each a hydrogen atom, R^ is a methyl group, and n = O: 
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Another most preferred protecting group, 
inethyl-6-nitroveratryloxycarbonyl (MeNVOC) , which is used 
to protect the amino terminus of an amino acid, for 
example, is formed when and are each a methoxy 
group, R^ and R^ are each a hydrogen atom, R^ is a methyl 
group, and n = 1: 




OMe 



Another most preferred protecting group, 
methyl-6-nitropiperonyl (MeNP) , which is used for 
protecting the carboxyl terminus of an amino acid or the 
hydroxy 1 group of a nucleotide, for example, is formed 
when R^ and R^ together form a methylene acetal, R^ and R* 
are each a hydrogen atom, R* is a methyl group, and n « 0: 




Another most preferred protecting group, 
methyl-6-nitropiperonyloxycarbonyl (MeMPOC) , which is 
used to protect the amino terminus of an amino acid or to 
protect te 5' hydroxyl of nucleosides, for example, is 
formed when R^ and R^ together form a methylene acetal, R^ 
and R^ are each a hydrogen atom, R^ is a methyl group, and 
n = 1: 
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A protected amino acid having a 
photoactivatable oxycarbonyl protecting group, such NVOC 
or NPOC or their corresponding methyl derivatives, MeNVOC 
or HeNFOC, respectively, on the amino terminus is formed 
by acylating the amine of the amino acid with an 
activated oxycarbonyl ester of the protecting group. 
Examples of activated os^carbonyl esters of NVOC and 
HeNVOC have the general formula: 




OMe OMe 
NVOC-X MeNVOC-X 



where X is halogen, mixed anhydride, phenoxy, 
p-nitropheno^, N-hydroxysuccinimide, and the like* 

A protected amino acid or nucleotide having a 
photoactivatable protecting group, such as NV or NP or 
their corresponding methyl derivatives, MeNV or HeNP, 
respectively, on the carboxy terminus of the amino acid 
or 5* -hydroxy terminus of the nucleotide, is formed by 
acylating the carboxy terminus or 5*H3H with an activated 
benzyl derivative of the protecting group. Examples of 
activated benzyl derivatives of MeNV and HeNP have the 
general formula: 




MeNV-X 



MeNP-X 
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where X is halogen, hydroxy 1, tosyl, mesylf 
trifluoromethylr diazo, azido, and the like. 

Another method for generating protected 
monomers is to react the bensylic alcohol derivative of 
the protecting group with an activated ester of the 
monomer. For example, to protect the cstrboxyl terminus 
of an amino acid, an activated ester of the amino acid is 
reacted with the alcohol derivative of the protecting 
group, such as 6-nitroveratrol (NVOH) . Examples of 
activated esters suitable for such uses include 
halo'formate, mixed anhydride, imidazoyl formate, acyl 
halide, and also include formation of the activated ester 
in situ the use of common reagents such as DCC and the 
like. See Atherton si- tor other examples of 
activated esters. 

A further method for generating protected 
monomers is to react the benzylic alcohol derivative of 
the protecting group with an activated carbon of the 
monomer. For example, to protect the 5*-hydroxyl group 
of a nucleic acid, a derivative having a 5* -activated 
carbon is reacted with the alcohol derivative of the 
protecting group, such as methyl-6-nitropiperonol 
(MePyROH) . Examples of nucleotides having activating 
groups attached to the 5* -hydroxy 1 group have the general 
formula; 

OP 



where Y is a halogen atom, a tosyl, mesyl, 
trifluoromethyl, azido, or diazo group, and the like. 

Another class of preferred photochemical 
protecting groups has the formula: 
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Where R^, R^, and R^ independently are a hydrogen atom, a 
lower alJcyl, aryl, benzyl, halogen, hydroxyl, alk03^1, 
thiol, thioether, amino, nitro, carhoxyl, formate, 
foxmamido, salfanates, sulfide or phosphido group, R^ and 
r' independently are a hydrogen atom, an alkoxy, al]^l, 
halo, aryl, or alkenyl group, and n » Q or 1. 

A preferred protecting group, 
1-pyrenylmethyloxycarbonyl (PyROC) , which is used to 
protect the amino terminus of 2m amino acid, for example, 
id formed when R^ through R^ are each a hydrogen atom and 
n « 1: 




Another preferred protecting grovp, 
1-pyrezxylBiethyl (PyR) , which is used for protecting the 
carbosqr terminus of an amino acid or the hydroxyl group 
of a nucleotide, for example, is formed when R^ through R* 
are each a hydrogen atom and n « O: 
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An amino acid having a pyrenylmethyloxycarbonyl 
protecting group on its amino terminus is formed by 
acylation of the free amine of amino acid with an 
activated oxycarbonyl ester of the pyrenyl protecting 
group. Examples of activated oxycarbonyl esters of PyROC 
have the general formula: 




where X is halogen, or mixed anhydride, p-nitrophenoxy, 
or N-hydroxysuccinimide group, and the like. 

A protected amino acid or nucleotide having a 
photoactivatable protecting group, such as PyR, on the 
carboxy terminus of the amino acid or 5 '-hydroxy terminus 
of the nucleic acid, respectively, is formed by acylating 
the carboxy terminuis or 5**K)H with an activated 
pyrenylmethyl derivative of the protecting group. 
Examples of activated pyrenylmethyl derivatives of PyROC 
have the general formula: 




where X is a halogen atom, a hydroxyl, diazo, or azido 
group, and the like. 

Another method of generating protected monomers 
is to react the pyrenylmethyl alcohol moiety of the 
protecting group with an activated ester of the monomer. 
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For exan^ler an activated ester of an amino acid can be 
reacted with the alcohol .derivative of the protecting 
group, such as pyrenylmethyl alcohol (PyROH) , to form the 
protected derivative of the carboxy terminus of the amino 
acid. Examples of activated esters include halo* formate, 
mixed anhydride, imidazoyl formate, acyl halide, and also 
Include formation of the activated ester In situ and the 
use of common reagents such as OCC and the like. 

Clearly, many photosensitive protecting groups 
are suitable for use in the present invention. 

In preferred embodiments, the siibstrate is 
irradiated to remove the photoremovable protecting groups 
and create regions having free reactive moieties and side 
products resulting from the protecting group. The 
removal rate of the protecting groups depends on the 
wavelength and intensity of the incident radiation, as 
well as the physical and chemical properties of the 
protecting group itself. Preferred protecting groups are 
removed at a faster rate and with a lower intensity of 
radiation. For exaiaple, at a given set of conditions, 
HeMVOC and HeNPOC are photolytically removed from the 
N-terminus of a peptide chain faster than their 
unsubstituted parent con^unds, NVOC and UPOC, 
respectively • 

Removal of the protecting group is accomplished 
by irradiation to separate the reactive group and the 
degradation products derived from the protecting group. 
Not wishing to be boxmd by theory, it is believed that 
irradiation of an NVOC- and HeNVOC-protected oligomers 
occurs by the following reaction schemes: 

NVOC-AA -> 3,4-dimethoxy-6-nitrosobenzaldehyde + CO^ + AA 
HeNVOC--AA*> 3 , 4-dlmethoxy-6-nitrosoacetophenone + CO^ + AA 

where AA represents the H-termln\is of the amino acid 
oligomer. 
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Along with the unprotected amino acid, other 
products are liberated into solution: carbon dioxide and 
a 2,3-diinethoxy-6-nitrosophenylcarbonyl compound, which 
can react with nucleophilic portions of the oligomer to 
form unwanted secondary reactions* In the case of an 
KV0C-*protected amino acid, the degradation product is a 
nitrosobenzaldehyde, while the degradation product for 
the other is a nitrosophenyl ketone* For instance, it is 
believed that the product aldehyde from NVOC degradation 
reacts with free amines to form a Schiff base (imine) 
that affects the remaining polymer synthesis* Preferred 
photoremovable protecting groups react slowly or 
reversibly with the oligomer on the suppoxrt. 

Again not wishing to be bound by theory, it is 
believed that the product ketone from irradiation of a 
HeNVOC-protected oligomer reacts at a slower rate with 
nucleophiles on the oligomer than the product aldehydes 
from irradiation of the same NVOC-protected oligomer* 
Although not unambiguously determined, it is believed 
that this difference in reaction rate is due to the 
difference in general reactivity between aldehydes and 
ketones towards nucleophiles due to steric and electronic 
effects* 

The photoremovable protecting groups of the 
present invention are readily removed. For exaunple, the 
photolysis of Nonprotected ]>phenylalanine in solution 
having different photoremovable protecting groups was 
analyzed, and the results are presented in the following 
table: 

Ta^ble 

Photolvsis of Protected L->Phe->QH 



solvent HIQS HSeS MgHSQC MeNfOp 

Dioxane 1288 110 24 19 

5mH HjSOyDioxane 1575 98 33 22 
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The half life,r t^^^ is the time in seconds 
required to remove 50% of the starting amount of 
protecting group. HBOC is the 6-nits:oben2ylo^carbonyl 
group, NVOC is the 6*nitrdveratryloxycarbonyl group, 
MeNVOC is the aethyl«6«=nitroveratrylo3cycarbonyl group, 
and MeNPOC is the inethyl-6-nitropiperonyloxycarbonyl 
group « The photolysis was carried out in the indicated 
solvent with 362/364 nm«=-imvelength irradiation having an 
intensity of 10 WS/ca?, and the concentration of each 
protected phenylalanine was 0.10 nS, 

The table shows that deprotection of NV0C-, 
iieNVOC-, and HeHMC-protected phenylalanine proceeded 
faster than the deprotection of WBOCo Furthermore, it 
shows that the deprotection of the two derivatives that 
are substituted on the benzylic carbon, £SeS9V0C and 
MeNPOC, were photolysed at the highest rates in both 
dioxane and acidified dioxane. 

1. Ose of Photoreaov able Groups Pur incr 
Solid-^Phase Svnthesis of Peptide 

The fosmation of peptides on a solid«»phase 

support requires the st^iwise attachment of an amino acid 

to a substrate-*bound growing chain. In order to prevent 

unwemted polymerization of the monomieric amino acid under 

the reaction conditions, protection of the amino terminus 

of the amino acid is required* Mter the monomer is 

coupled to the end of the peptide, the H»terminal 

protecting group is removed, and another amdLno acid is 

coupled to the chain, rails cycle of coupling and 

deprotecting is continued for each amino acid in the 

peptide sequence. See Merrifield, J, Am. Chen, sor^. 

(1963) 85S2149, and Atherton sfe Si-, "'Solid Phase 

Peptide Synthesis^v 1989, IHL Press, London, both 

incorporated herein by reference for all purposes » As 

described above, the use of a photoremovable protecting 

group allows removal of selected portions of the 
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substrate surface^ via patterned irradiation , during the 
deprotection cycle of the solid phase synthesis. This 
selectively allows spatial control of the synthesis—the 
next amino acid is coupled only to the irradiated areas. 

In one eoibodiaient, the photoremovable 
protecting groups of the present invention are attached 
to an activated ester of an amino acid at the amino 
terminus: 



Y' 



NK-X 



where R is the side chain of a natural or unnatural amino 
acid, X is a photoremovable protecting group, and Y is an 
activated carboxylic acid derivative, The photoremovable 
protecting group, X, is preferably NVOC, NPOC, PyROC, 
MeMVOC, HeNPOC, and the like as discussed above. The 
activated ester, Y, is preferably a reactive derivative 
having a high coupling efficiency, such as an acyl 
halide, mixed anhydride, N-hydroxysuccinimide ester, 
perfluorophenyl ester, or urethane protected acid, and 
the like. Other activated esters and reaction conditions 
are well known (See Atherton s& al. ) • 

2. wse of Phot oremovable Groups During 

fiftiid^Pha ae Synthesis of Oliaonucleotidea 

The formation of oligonucleotides on a 

solid-phase support requires the stepwise attacEhment of a 

nucleotide to a substrate-bound growing oligomer. In 

order to prevent unwanted polymerization of the monomeric 

nucleotide under the reaction conditions, protection of 

the 5»-hydroxyl group of the nucleotide is required. 

After the monomer is coupled to the end of the oligomer, 

the 5«-hydroxyl protecting group is removed, and another 

nucleotide is coupled to the chain. This cycle of 
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coupling and deprotecting is continued for each 
nucleotide in the oligomer sequence* See Gait, 
"Oligonucleotide Synthesis: A Practical Approach" 1984, 
IRL Press, London, incorporated herein by reference for 
all purposes. As described above, the use of a 
photorenovable protecting group allows removal, via 
patterned irradiation, of selected portions of the 
substrate surface during the deprotection cycle of the 
solid phase synthesis* This selectively allows spatial 
control of the synthesis — the next nucleotide is coupled 
only to the irradiated areas. 

Oligonucleotide synthesis generally involves 
coupling an activated phosphorous derivative on the 
3»-hydroxyl group of a nucleotide with the 5 ■ -hydroaq^l- 
group of an oligomer bound to a solid support. Two major 
chemical methods exist to perform this coupling: the 
phosphate-tries ter and phosphoramidite methods (See 
Gait] • Protecting groups of the present invention are 
suitable for use in either method. 

In a preferred embodiment, a photoremovable 
protecting group is attached to an activated nucleotide 
oil the 5*-hydro3^1 group: 




where B is the base attached to the sugar ring.; R is a 
hydrogen atom when the sugar is deoxyribose or R is a 
hydroxyl group when the sugar is ribose; P represents an 
activated phosphorous group; and X is a photoremovable 
protecting group. The photoremovable protecting group, 
X, is preferably NV, NP, PyR, MeNV, MeMP, NVOC, NPOC, 
PyROC, HeNVOC, MeNPOC, and the like as described above. 
The activated phosphorous group, P, is preferably a 
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reactive derivative having a high coupling efficiency, 
such as a phosphate-triester, phosphoramidite or the 
like. Other activated phosphorous derivatives, as well 
as reaction conditions, are well known (See Gait) . 
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E. Amino Acid N^Carboxv Anhydrides 

Protecrfced With a Photoremovable Group 

During Merrifield peptide synthesis, an 
activated ester of one amino acid is coupled with the 
free amino terminus of a substrates-bound oligomer. 
Activated esters of amino acids suitable for the solid 
phase synthesis include halo^formate, mixed anhydride, 
imidazoyl formate, acyl halide, and also includes 
formation of the activated ester in situ and the 
use of common reagents such as DCC and the like 
(See Atherton et al.) • A preferred protected and 
activated amino acid has the general formula: 




where R is the side chain of the amino acid and' X is a 
photoremovable protecting group. ISiis compound is a 
urethane-protected amino acid having a photoremovable 
protecting group attached to the amine* A more preferred 
activated amino acid is formed when the photoremovable 
protecting group has the general formula: 




where R^, R^, R^, and R^ independently are a hydrogen atom, 
a lower alkyl, aryl, benzyl, halogen, hydroxy 1, alkoxyl, 
thiol, thioether, amino, nitro, carboxyl, formate, 
formamido or phosphide grot^, or adjacent substituents 
(i.e.^ R^-R^, R^*R^, R^-R*) are substituted oa^en groups 
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that together form a cyclic acetal or ketal; and V? is a 
hydrogen atom, alkoxyl, alkyl, halo, aryl, or alkenyl 
group. 

A preferred activated amino acid is 
formed when the photoremovable protecting group is 
6-nitroveratryloxycarbonyl. That is, and are each a 
hydrogen atom, and r' are each a methoxy group, and R* 
is a hydrogen atom. Another preferred activated amino 
acid is formed ^en the photoremovable group is 
6-nitropiperonyl: R* and R* are each a hydrogen atom, R* 
and together form a methylene acetal, and R* is a 
hydrogen atom. Other protecting groups are possible. 
Another preferred activated ester is formed when the 
photoremovable group is methyl-6-nitroveratryl or methyl- 
6-nitropiperonyl . 

Another preferred activated amino acid is 
forined when the photoremovable protecting group has the 
general formula t 




where R*, r', and r' independently are a hydrogen atom, a 
lower alkyl, aryl, benzyl, halogen, hydroxy 1, alkoxylr 
thiol, thioether, amino, nitro, carboxyl, formate, 
formamido, sulfanate, sulfide or phosphide group, and R^ 
and r' independently are a hydrogen atom, an alkoxy, 
alkyl, halo, aryl, or alkenyl group. The resulting 
eoapoond is a urethane-protected amino acid having a 
pyrenylmethyloxycarbonyl protecting group attached to the 
amine. A more preferred embodiment is formed when R^ 
through R* are each a hydrogen atom. 

The urethane-protected asdno acids having a 
^otoremovable protecting group of the present invention 
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are' prepared by condensation of an M-protected aiaino acid 
with an acylating agent such as an acyl halide, 
anhydride, chlorof ormate and the like (See Fuller et al. , 
U.S. Patent No. 4,946,942 and Fuller et al • , J. Amer. 
Chem. Soc. (1990) 112 ; 7414-7416 ^ both herein incorporated 
by reference for all purposes) » 

Urethane-protected amino acids having 
photoremoveJ^le protecting groups are generally useful as 
reagents during solid-phase peptide synthesis, and 
because of the spatial selectivity possible with the 
photoremovable protecting groups, are especially \iseful 
for the spatially addressing peptide synthesis These 
aioino acids are difunctional: the urethane group first 
serves to activate the carboxy terminus for reaction with 
the amine bound to the surface, and, once the peptide 
bond is formed, the photoremovable protecting group 
protects the newly formed amino terminus from further 
reaction. These amino acids are also highly reactive to 
nucleophiles, such as deprotected amines on the surface 
of the solid support, and due to this high reactivity, 
the solid-phase peptide coupling times are significantly 
reduced, euid yields are typically higher. 

IV. p^t^ coi;i,ectton 

A. Data Collection System 

Substrates prepared in accordance with the 
above description are used in one embodiment to determine 
which of the plurality of sequences thereon bind to a 
receptor of interest. Fig. 10 illustrates one embodiment 
of a device used to detect' regions of a substrate which 
contain florescent markers. This device would be used, -* 
for example, to detect the presence or absence of a 
fluorescently labeled receptor such as an antibody which 
has bound to a synthesized polymer on a substrate. 

Light is directed at the substrate from a light 
source 1002 such as a laser light source of the type well 
known to those of skill in the art such as a model no. 
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2025 made by Spectra Physics. Light from the source is 
directed at a lens 1004 which is preferably a cylindrical 
lens of the type well known to those of slcill in the art. 
The resulting output from the lens 1004 is a linear beam 
rather than a spot of light. Thus, data can be detected 
substantially simultaneously along a linear array of 
pixels rather than on a pixel-by-pixel basis. It will be 
understood that while a cylindrical lens is used herein 
as an illustration of one technique for generating a 
linear beam of light on a surface, other techniques could 
also be utilized. 

The beam from the cylindrical lens is passed 
through a dichroic mirror or prism and directed at the 
smrface of the suitably {>repared substrate 1008. 
Substrate 1008 is placed on an x-y translation stage 1009 
such as a model no. FH500-8 made by Newport. Certain 
locations on the substrate will fluoresce and 
fluorescence will be transmitted along the path indicated 
by dashed lines back through the dichroic mirror, and 
focused with a suitable lens 1010 such as an f/1.4 camera 
lens on a linear detector 1012 via a variable f stop 
focusing lens 1014. Through use of a linear light beam, 
it becomes possible to generate data over a line of 
pixels (such as about 1 cm) along the substrate, rather 
than from individual points on the substrate. In 
alternative embodiments, light Is directed at a 2- 
dimensional area of the substrate and fluorescence is 
detected by a 2-dimensional CC3> array. Linear detection 
is preferred because substantially higher power densities 
are obtained* 

Detector 1012 detects the amount of 
fluorescence emitted from the substrate as a function of 
position. According to one embodiment the detector is a 
linear CCD array of the type commonly known to those of 
skill in the art. The x-y translation stage, the light 
source, and the detector 1012 are all operably connected 
to a computer 1016 such as an IBM PC-AT or equivalent for 
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control of the device and data collection from the CCD 
array. 

In operation^ the substrate is appropriately 
positioned by the translation stage. The light source is. 
then illuminated, and fluorescence intensity data are 
gathered with the computer via the detector. 

In an alternate embodiment, the substrate and 
x/y translation table are placed under a microscope which 
includes one or more objectives. Light (about 488 nm) 
from a laser, which in some embodiments is a model no. 
2020-^05 argon ion laser manufactured by Spectraphysics, 
is directed at the substrate by a dichroic mirror which 
passes greater than about 520 nm light but reflects 
488 nm light. The dichroic mirror may be, for example, a 
model no. FTSio manufactured by Carl Zeiss. Light 
reflected from the mirror then enters the microscope 
which may be, for example, a model.no. Axloscop 20 
manufactured by Carl Zeiss. Fluorescein-marked materials 
on the substrate will fluoresce >488 nm light, and the 
fluoresced light will be collected by the microscope and 
passed through the mirror. The fluorescent light from 
the substrate is then directed through a wavelength 
filter and, thereafter" through an aperture plate. The 
wavelength filter may be, for example, a model no. OG530 
manufactured by Melles Griot and the aperture plate may 
be, for example, a model no. 477352/477380 manufactured 
by Carl Zeiss. 

The fluoresced li^t then enters a 
photomultiplier tube which in some embodiments is a model 
no. R943-02 manufactured by Hamamatsu, the signal is 
amplified in a preamplifier and photons are counted by a 
photon counter. The number of photons is recorded as a 
function of the location in the computer. The pre-amp 
may be, for example, a model no. SR440 manufacttired by 
Stanford Research Systems and the photon counter may be a 
model no. SR400 manufactured by Stanford Research 
Systems. The substrate is then moved to a subsequent 



wo 92/10092 



57 



PCr/US91/08693 



location and the process is repeated* In preferred 
embodiments the data are . acquired every 1 to 100 with 
a data collection diameter of about 0*8 to 10 |im 
preferred. In embodiments with sufficiently high 
fluorescence, a CCD detector with broadf ield illumination 

is utilized. 

Fig. 11 illustrates the architecture of the 
data collection system in greater detail* Operation of 
the system occurs under the direction of the photon 
counting program 1102. The user inputs the scan 
dimensions, the number of pixels or data points in a 
region, and the scan speed to the counting program. Via 
a 6PIB bus 1104 the program (in an Iim PC conqpatible 
computer, for example) interfaces with a multichannel 
scaler 1106 such as a Stanford Research SR 430 and an x-y 
stage controller 1108 such as a Newport PH500. The 
signal from the light from the fluorescing substrate 
enters a photomultiplier 1110, providing output to the 
scaler 1106. Data are output from the scaler indicative 
of the number of counts in a given region. After 
scanning a selected area, the stage controller is 
activated with commands for acceleration and velocity, 
which in turn drives the scan stage 1112 such as a 
Newjport PH500-A to another region. 

Data are collected in an image data file 1114 
and processed in a scaling program 1116. A scaled image 
is output for display on, for example, a VGA display 
1118. The image is scaled based on an input of the 
percentage of pixels to clip and the minimum and maximum 
pixel levels to be viewed. The system outputs for use 
the min and max pixel levels in the raw data. 

B. p?^ta ABaXYpji.s 

The output from the data collection system is 
an array of data indicative of fluorescence intensity 
versus location on the substrate. The data are typically 
taken over regions substantially smaller than the area in 



wo 92/10092 



58 



PCr/US91/08693 



which synthesis of a given polymer has taken place. 
Merely by way of example,, if polymers were synthesized in 
sc[uares on the substrate having dimensions of 500 microns 
by 500 microns, the data may be taken over regions having 
dimensions of 5 microns by 5 microns. In most preferred 
embodiments, the regions over which florescence data are 
taken across tlie substrate are less than about 1/2 the 
area of the regions in which individual polymers are 
synthesized, preferably less than 1/10 the area in. which 
a single polymer is synthesized, and most preferably less 
than 1/100 the area in which a single polymer is 
synthesized. Hence ^ within any area in which a given 
polymer has been synthesized, a large number of 
fluorescence data points are collected. 

A plot of the number of pixels versus • 
fluorescence intensity for a scan of a cell when it has 
been exposed to, for exasple, a labeled antibody will 
typically take the form of a bell curve, but spurious 
data are observed, particularly at higher intensities. 
Since it is desirable to use an average of fluorescence 
intensity over a given synthesis region in determining 
relative binding affinity, these spurious data will tend 
to undesirably skew the data. 

Accordingly, in one embodiment of the invention 
the data are corrected for removal of these spurious data 
points, and an average of the data points is thereafter 
utilized in determining relative binding efficiency. 

Fig. 12 illustrates one embodiment of a system 
for removal of spurious data from a set of fluorescence 
data such as data used in affinity screening studies. A 
user or the system inputs data relating to the chip 
location and cell comers at step 1302. From this 
information and the image file, the system creates a 
computer representation of a histogram at step 1304, the 
histogram (at least in the form of a computer file) 
plotting number of data pixels versus intensity. 
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For each cell, a main data analysis loop is 
then performed. For each cell, at step 1306, the system 
calculates the total fluorescence intensity or number of 
pixels for the bandwidth centered around varying 
intmsity levels. Por example, as shown in the plot to 
the right of step 1306, the system calculates the number 
of pixels within the band of width w. The system then 
"moves" this bandwidth to a higher center intensity, and 
again calculates the number of pixels in the bandwidth. 
This process is repeated until the entire range of 
intensities have been scanned, and at step 1308 the 
system determines which band has the highest total number 
of pixels. The data within this bandwidth are used for 
further analysis. Assuming the bandwidth is selected to 
be reasonably small, this procedure will have the effect 
of eliminating spurious data located at the higher 
intensity levels. The system then repeats at step 1310 
if all cells have been evaluated, or repeats for the next 
cell. 

At step 1312 the system then integrates the 
data within the bandwidth for each of the selected cells, 
sorts the data at step' 1314 using the synthesis procedure 
file, and displays the data to a user on, for example, a 
video display or a printer. 

V. Pepraaento tl^ive Applications 

A. f>i1«Tft«»ieieottde Svnthesis 

generality of light directed spatially 
addressable parallel chemical synthesis is demonstrated 
by application to nucleic acid synthesis. 



1. B;iKa»^P3..e. 

idght activated formation of a thymidine- 
eytidine diner was carried out. A three dimensional 
representation of a fluorescence scan showing a 7 square 
by 4 square cbeckeiUoard pattern generated by the light- 
directed synthesis of a dinucleotide was produced. 
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5'-h£troVeratryl thymidine was attached to a synthesis 
substrate through the 3*. hydroxy 1 group. The 
nitroveratryl protecting groups were removed by 
illumination through a 500 im checkerboard mask. The 
substrate was then treated with phosphoramidite activated 
2'-^eo3qrc^idine. In order to follow the reaction 
fluorometrically, the deoxycytidine had been modified 
with ait FMOC protected aminohexyl linker attached to the 
expcyclic amine (5'-o-dim6thoxytrityl-4-H-(6-H- 
fluorenylmethylcarbamoyl-hexylcarboj^) -2 '-deoxycytidine) . 
After removal of the FMOC protecting group with base, the 
regions which contained the dinudeotide were 
fluorescently labelled by treatment of the substrate with 
1 ma FITC in DMF for one hour. 

The three-dimensional representation of the 
fluorescence intensity data showing alternating squares 
of bright raised pixels reproduces the checkerboard 
illumination pattern used during photolysis of the 
substrate. This result demonstrates that 
oligonucleotides as well as peptides can be synthesized 
by the light-directed method. 

In another example the light-activated 
formation of thymidine-cytidine-cytidine was carried out 
as shown in Fig. 13. Here, as in the previous example, 
5 •-nitroveratryl thymidine was attached to the substrate, 
via phosphoramidite chemistry to a surface containing 
[Bis {2-hydro3cyethyl)-3-aminopropylsiloxane}. The slide 
was then unlfonnly Illuminated (3«2nm at - 14mw/cai*) for 
10 minutes in the presence of dioxane. After drying, the 
surface was then treated with W,4-dimeth6xytrityl-5»- 
nitrov6ratryl-2 '-deoxycytidine-B • -o- (2-cyanoethyl) -N,N- 
diisopropylphosphoramidite in the presence of tetrazole 
(standard phosphoramidite coupling chemistry) . After 
oxidizing and drying, the plate was again illuminated as 
before except that a 500 ftm checkerboard mask was placed 
between the light source and the slide. The surface was 
then exposed to 5»-o-(4,4 »-Dimethoxy)-M-4-(6- 
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( (Biotinoyl) amino) hexanoyl) amino) hexanoyl , aminohexyl) -5* 
methyl-2 • -deoxycytidine-3 • -o- ( 2 -cyanoethy 1 ) -N , N- 
diisopropylphosphoramidite with tetrazale. After 
oxidizing and drying, the areas which contained the 
trinucleotide were fliiroescently labelled by treatment 
with FITC labled streptavidin. A resulting 
representation of the fluorescence intensity data showed 
alternating bright and dark squares corresponding to the 
500 im and checkerboard illumination pattern used dxiring 
photolysis. 

VI. conclusion 

The inventions herein provide a new approach 
for the simultaneous synthesis of a large niunber of 
compounds. The method can be applied whenever one has 
chemical building blocks that can be coupled in a solid* 
phase format, and when light can be used to generate a 
reactive group. 

The above description is illustrative and not 
restrictive. Many variations of the invention will 
become apparent to those of skill in the art upon review 
of this disclosure. Merely by way of example, while the 
invention is illustrated primarily with regard to peptide 
and nucleotide synthesis, the invention is not so 
limited. The scope of the invention should, therefore, 
be determined not with reference to the above 
description, but instead should be determined with 
reference to the appended claims along with their full 
scope of equivalents. 
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(2) INFOXaoaZON FOR SEQ ID N0:1: 

(i) SEQUmCE CHARACTERISTICS: 
<A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDHESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(aci) SEQUENCE DESCRIPTION: SEQ ID NOrl: 

Tyr Gly Gly Pha Leu 
1 5 
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(2) INFORH&TION FOR SEQ ID NO: 2: 

(i) SEQUENCE CBARACTERXSTZCS: 

(A) LEH6TB: 5 amino aoids 

(B) TYPE: amino acid 

(C) STRMIDEDNB8S: single 

(D) TOP0L06Y: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQT7ENCE DESCRIPTION: ID NO: 2: 



Pro 61y Gly Pbe Leu 
1 5 
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(2) IHFQRiaTION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(&) LEKGTR: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Tyr Gly Ala Gly Phe 
1 5 
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IHFOHHMflON FOR SEQ ID N0:4: 

(1) SSQUENCE CHARACTKRXSTXCS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SBQUEKCE DESCRIPTION: SEQ ID NO: 4: 

Tyr Gly Ala Phe Leu Ser 
1 5 
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(2) INFOiaaXIOK FOR SEQ NOiS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRftNDEDNESS: single 

(D) TOPOUXSZ: linear 

(11) HOLECDLE TYPE: peptide 



(Xl) SEQUENCE DESCRIPTION: SEQ ID HO: Si 

Tyr Gly Ala Phe Ser 
1 5 
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(2) INFOKmTION FOR SEQ ID NO: 6: 

(1) SEQUENCE cmkRACTERZSTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Tyr Gly Ala Phe Leu 
1 5 



wo 92/10092 PCrAJS91/08«O 

68 



(2) IMFOSHMMOB FOR SEQ ID N0:7: 

(i) SEQUENCE cmRACTERISTICS : 
(A) LEK6TH: 6 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECOLE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Tyr Gly Gly Phe Leu Ser 

1 5 
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(2) INPORMATIOM FOR SBQ ID NO: 8: 

(i) SEQUEHCE CHJOmCTERISTXCS: 

(A) LENGTH: 4 amino acids 

(B) TYPB: aaino aoid 

(C) STIANDEDMBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Tyr Gly Ala Phe 
1 
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INFOBHAXIOH FOR SBQ ID H0:9i 

(i) SEQUBWCE CBftRILCTERISTICS: 
(A) LENGTH; 5 amino acids 
(B> TYPE: amino acid 

(C) STRAHDEDNBSSi single 

(D) topology: linear 

(ii) HOLSCULE TYPE: p^tide 

(Xi) SEQUENCE DESCRIPTrON: SEQ ID 110:9 i 

Tyr Gly Ala Leu ser 
1 5 
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IHPOSHATION FOR SBQ ID HO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Tyr Gly Gly Phe Ser 
1 5 
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INFOBKATIOM FOR SEQ ID NO: lit 

(1) SEQUENCE CH2UEtACTEEtZSTZCS: 
(A> LENGTH: 4 aMno acids 

(B) TYPE: amino acid 

(C) STR&NDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Tyr Gly Ala Leu 
1 



IHFOWaTION FOR SBQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRAHDEDHESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tyr Gly Ala Phe Leu Phe 
1 5 
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(2) INFOKHATIOM FOR SEQ ZD NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENSER: 5 amino adds 

(B) TYPE: amina acid 

(C) STRANOEDHESS: single 

(D) TOFOLOGYt linear 

(il) HOLECOLE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13; 

Tyr 61y Ala Phe Phe 
1 5 
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IHFORKATION FOR SEQ ZD HO: 14: 

(i) SEQUENCE CHAKACTKRISTICS: 

(A) LENGTB; 5 amino acids 

(B) TYPE: amino acid 

(C) snANDBONESS: single 
(0) TOPOLOGY: linear 

(ii) MOI.EC0LE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTIOH: SEQ ID HO: 14: 

Tyr Gly 6ly Leu Ser 
I 5 
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(2) INFOBH&TIOK FOR SEQ ID HO: 15: 

(i) SEQUENCE CHKR&CTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TXPE: amino acid 

(C> STBAKDEDNESS: single 
(O) TOPOLOGY: linear 

(ii) HOLECDLE TZPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Gly 61y Phe Leu 
1 5 
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IHPORMATION FOR SEQ ID NO: 16: 

(i) SEQUEHCB CHARACTERISTICS: 

(A) LBHGTH: 6 amino adds 

(B) TYPE: amino acid 

(C) STRAHDEDHBSSt sin9l« 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Tyr Gly Ala Phe Ser Phe 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHftSACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECDLB TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:17j 

Tyr Gly Ala Phe Leu Ser Phe 
1 5 
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(2) INFORHATIOM FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENCTH: 6 amino adds 

(B) TYPE: anino acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLE TYPE: peptide 



(xi) SEQUiaiCE DESCRIPTION: SEQ ID NO: 18: 

Tyr Gly Ala Phe Met Gin 
1 5 
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IHF0RM21TI0N FOR S£Q ID NO: 19; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: asdno aeid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Tyr Gly Ala Phe Met 
1 5 
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INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Tyr Gly Ala Phe Gin 
1 S 
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(2) INFOPMATIOH FOR SEQ ID N0:21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Tyr Gly Gly Phe Met 

1 .5 
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mkT IS CLMMED IS: 

1. A reactor system for synthesizing a plurality 
of polymer sequences on a substrate comprising: 

a) a reactor for contacting reaction fluids to 
said substrate; 

b) a system for delivering selected reaction 
fluids to said reactor; 

c) a translation stage for moving a mask or 
substrate from at least a first relative location 
relative to a second relative location; 

d) a light for Illuminating said substrate 
through a mask at selected times; and 

e) an appropriately programmed digital computer 
for selectively directing a flow of fluids from said 
reactor system, selectively activating said translation 
stage, and selectively illuminating said substrate so as 
to form a plurality of diverse polymer segfuences on said 
substrate at predetermined locations, 

2. The reactor system as recited in claim 1 
adapted to provide a plurality of monomers in a reaction 
fluid to said substrate, said substrate used for an 
initial screening of polymer sequences, 

3. An ordered method for forming a plurality of 
polymer sequences by sequential addition of reagents 
coBqE>rising the step of serially protecting and 
deprotecting portions of said plurality of polymer 
sequences for addition of other portions of said polymer 
sequences using a binary synthesis strategy. 

4. The method as recited in claim 3 wherein said 
binary synthesis strategy is a binary masking strategy, 

5. The method as recited in claim 4 wherein said 
masking strategy in which said masking strategy provides 
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at' least two consecutive steps in which a mask factors a 
previous mask by protecting a portion of a previously 
illuBiinated portions to light and exposing a portion of a 
previously protected portions to light. 

6. The method as recited in claim 4 in which said 
masking strategy in which at least two successive steps 
in said masking strategy illuminate about one half of a 
region of interest on said substrate. 

7. The method as recited in claim 4 wherein said 
masking strategy forms a plurality of polymer sequences 
on a single substrate. 

8. The method as recited in claim 4 wherein said 
masks are arranged in a gray code masking strategy, said 
gray code masking strategy having one edge illumination 
on each of a plurality of synthesis sites. 

. 9 • The method as recited in claim 4 wherein said 
masking strategy results in a minixnim number of masking 
steps for a number of polymers synthesized. 

10. The. method as recited in claim 4 wherein all 
possible polymers of length 1 are formed with a given 
basis set of monomers. 

11. The method as recited in claim 4 wherein said 
masking strategy is developed in an appropriately 
programmed digital cos^uter inputting at least a desired 
basis set, and length of polymers. 

12. The method as recited in claim 4 wherein all 
possible polymers of a length less than or ecpial to 1 are 
formed with a given basis set of monomers. 



wo 92/10092 



PCr/US91/08693 



85 

13. Tlie method as recitied in claim 4 further 
comprising the step of forming a portion of said polymers 
with a non-binary masking strategy • 

14. The method as recited in claim 10 further 
comprising the step of outputting a masking strategy. 

15. The method as recited in claim 10 further 
comprising the step of outputting a map of synthesized 
polymers on said substrate. 

16. The method as recited in claim 15 wherein said 
map is in the form of Fig. 9. 

17. A method of screening a plurality of linker 
polymers for use in binding affinity studies comprising 
the steps of: 

a) forming a plurality of linker polymers on a 
substrate in selected regions, said linker polymers 
formed by the steps of recursively: 

i) on a surface of a substrate, irradiating 
a portion of said selected regions to remove a 
protecting groi^; and 

ii) contacting said surface with a monomer; 

b) contacting said plurality of linker polymers 
with a ligand; and 

c) contacting said ligand with a labeled 

receptor. 

18. The method as recited in claim 17 wherein said 
ligand is a polypeptide. 

19. The method as recited in claim 17 wherein said 
receptor is an antibody. 

20. The method as recited in claim 17 wherein said 
monomers added in step ii) are the same in each of said 
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recursive- 8l:eps, said selected regions comprising linker 
molecules of different lengths. 

21- The method as recited in clgiim 17 wherein said 
labelled receptor is a fluoresceina^ed receptor. 

22. A system for determijiing affinity of a receptor 
to a ligand comprising: 

a) means for applying light: to a surface of a 
substrater said substrate coiq)rising a plurality of 
ligands at predetermined locations, said means for 
applying directing light providing simultaneous 
illumination at a plurality of said predetermined 
locations; and 

b) an array of detectors for detecting 
fluorescence at: said plurality of predetiermined 
locations. 

23. A system as recited in claim 22 wherein said 
means for applying light comprises a point light source 
and a cylindrical lens for focusing said point light 
source along a substantially linear palih. 

24. A system as recited in claim 22 wherein said 
array of detectors comprises a linear array. 

25. A system as recited in claim 22 wherein said 
array of detectors comprises a linear CCD array. 

26. In a digital computer, a method of determining 
the tendency of a receptor to bind to a ligand 
comprising: 

a) exposing fluorescently labelled receptors to 
a substrate, said substrate cotaprising a plurality . of 
ligcmds in regions at known locations; 
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b) at a plurality of data collection points 
within each of said regions^ determining an amount of 
fluorescence from said data collection points; 

c) removing said data collection points 
deviating from a preset amount from a predetermined 
statistical distribution; and 

d) determining a relative binding affinity of 
said receptor to remaining data collection points. 

27. The method as recited in claim 26 wherein said 
predetermined statistical distribution is a normal 
distribution* 

28. A compound having the formula: 




OMe 

wherein n = 0 or 1; Y is selected from the group 
consisting of an oseysen of the carboxyl group of a 
natural or unnatural amino acid, an amino group of a 
natural or unnatural amino acid, or the c~5' oxygen group 
of a natural or unnatural deoxyribonucleic or ribonucleic 
acid; and independently are a hydrogen atom, a lover 
alkyl, aryli benzyl, halogen, hydroxyl, alkoxyl, thiol, 
thioether, amino, nltro, carboxyl, formate, formamldo, 
sulfldo, or phosphldo group; and R^ is a alkoxy, alkyl, 
aryl, hydrogen, or alkenyl group. 

29. The compound of claim 28 wherein Y is the 
oxygen group of a natural or unnatural deoxyribonucleic 
or ribonucleic acid. 

30 • The compound of claim 29 wherein n = 0. 
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31. The coii«>ound of claift 29 wherein and are 
each a hydrogen atom. 

32. The con^iind of claim 31 wherein R^ is a 
hydrogen atom. 

33. The compound of claim 31 wherein is a methyl 
group. 

34. The conqpound of claim 28 wherein Y is an oxygen 
of the carboxyl group of an amino acid and n - 0. 

35. The compound of claim 34 wherein R^ and R^ are 
each a hydrogen atom* 

36. The compound of claim 35 wherein R^ is a 
hydrogen atom* 

37. The compound of claim 35 wherein R^ is a methyl 
group. 

38. A compound having the formula: 




wherein n = 0 or l? Y is selected from the group 
consisting of an amino group of a natural or tinnatural 
amino acid or the C-5' oxygen group of a natural or 
unnatural deoxyribonucleic and ribonucleic acid; R^, R*, 
and R^ independently are a hydrogen atom, a lower allcyl« 
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aryl, bensyl, halogen, hydroxyl, alkoscyl, thiol, 
thioether, amino, nitro,- carboxyl, formate, formamido, 
sulfido or phosphide group? R* and independently are a 
alkoxy, alley 1, hydrogen, halo, aryl, or alkenyl group- 

39. The compound of claim 38 wherein R^ through R^ 
are each a hydrogen atom. 

40. The compound of claim 39 wherein R^ and R^ are 
each a hydrogen atom. 

41. The compound of claim 39 wherein R^ and R^ are 
each a methyl group. 

42. A compound having the formula: 



wherein n«Oorl;Yisa c-5« oxygen group of a natural 
or unnatural deoxyribonucleic and ribonucleic acid? R^ 
through R^ independently are a hydrogen atom, a lower 
alkyl, aryl, benzyl, halogen, hydroxyl, alkoxyl, thiol, 
thioether, amino, nitro, carboxyl, formate, formamido, 
sulfido, or phosphide group; and r' is a alkoxy, alkyl, 
aryl, or alkenyl group. 

43. The compound of claim 42 wherein R^ and R^ are 
each a methoxy group. 

44. The compound of claim 43 wherein R^ and R^ are 
each a hydrogen atom. 




O r5 



NO2 
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45* The compoiind of claim 44 wherein is a methyl 
group* 

46* A compound having 1:he formula: 




wherein n = O or 1? Y is an atom to be protected; and 

independently are a hydrogen atom, a lower alkyl, aryl, 
benzyl, halogen, hydroxyl, alkoxyl, thiol, thioether, 
amino, nitro, carboxyl, formate, formamidOr sulfido, or 
phosphido groupr and R^ is a alkoxy, alkyl, aryl, or 
alkenyl group. 

47. The compound of claim 46 wherein Y is selected 
from the group consisting of an oxygen of the carboxyl 
group of a natural or unnatural amino acid, or the c-*5' 
oxygen group of a natural or unnatural deoxyribonucleic 
or ribonucleic acid, or the amino group of a natural or 
unnatural amino acid« 

48. The compound of claim 47 wherein R^ and R^ are 
hydrogen. 

49. The compound of claim 48 wherein R^ is a methyl 
group. • 

50. A compound having the formula: 



O 
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where R is a side chain of a natural or unnatural amino 
acid and X is a photoremovable protecting group. 

51. The compound of claim 50 wherein X has the 
following formula: 



where R^, R^r R^r and R* independently are a hydrogen atom, 
a lower alkyl, aryl, benzyl, halogen, hydroxyl, alkoxyl, 
thiol, thioether, amino, nitro, carboxyl, formate, 
formamido or phosphide group, or adjacent substituents 
are substituted oxygen groups that together form a cyclic 
acetal or ketal; and R^ is a hydrogen atom, a alkoxyl, 
alkyl, halo, aryl, or alkenyl group. 

52* The compound of claim 51 herein R^ and R^ are 
each a hydrogen atom, and R^ and R^ are each a methoxy 
group. 

52. The compound of claim 52 wherein R^ is a methyl 
group » 

54. The compound of claim 51 wherein R^ and R^ are 
substituted oxygen groups that together form a cyclic 
acetal. 

55. The compound of claim 54 wherein R^ and R"* are 
each a hydrogen atom. 
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56. The coaqpound of claim 55 wherein is a methyl 
group.- 
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issued 21 October 1970, A. Patchomik et al., 
**Photosehsitive Protecting Groups", pages 6333-6335. 
See page 6334 « Scheme I. 
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V. □ OBSERVATIONS WHERE CERTAIN CLAIMS WERE POUND UNSEARCHABLE^ 



TNstm«n«lonalB««diraporthatnat bMnettoMishedinrvspaaof c«m^ 170) Ul forihB foOowmg m 

1 . Q CUkn nunrim ^ b«c«iM th«y ralats to subject mattor <1) net /equirad to bo tawehod by tWt Authority, ntmaly: 



2. □ Qitni nun*M ^ bacauto thoy ralato to p«ta of tho bttomitkmil app 
prateifbad lOcMramarM to auch an axtart that no moanirvM Im^^ 



3.D Oatmnurtei^ bocaunthtyaradapandamdalmanotdrafcadln 

of PCT fl^. 



accofdinco with tha tioond and tMnl santincaa 



VLD OeSERVATIOWSVWERg unity of BiVBmOWiSLACiaNO^ 



TMa Intamational SaarcNng Authomy found rnuMplo invamiont In tNs intamational app8estton aa fotows: 



!«□ Aa^to^dwdaddWcyiy aichtaMw afatln^ 
diiwa Of tfio hMantadonil appl ic i ti on» 

^ D At only soma of tho laqulrod adifiik)nal M«eh faaa ware timaly pa^ 

only thoia daima of tha imomational application for >MNch faaa ware pM^ 



3. n M» 'Wtrad tddftiony lovth foaa 
raainctoa to tna oivonDon nrat 



waro timaly paid by tha appltcam. 

' in tha dainta; it it covatad by dabn 



tNa tntamsdonal taarah lapoit ia 



4. PI Aa aa aaarchabia dafma could ba aaarchad without affoit lustifyinQ an 
not InMta payment of any additional foe. 

Ramafk'on pfotatt 

Q The additional taarch feat ware accompanied by appUeant't pfotaat< 

Q No protait accompanied tha payment of additional taarch feat. 



additional fee. the bitemadonal Seaich Auttwiity did 



Form I>CT;ISA/210 (aupplemantai •heetC2)URev. 4-90) S 



InMmatienal ApplieatiDn r i»eTAI891/086S3 



niRTHER INFORMATION CONTfNUED FROM PRtSVIOUS SNGEfS 



Z. CLASSZPlCaXZbH OP StJBJBCr HAZTSRs 
IPC (5) : 

AOIH 1/02; CI20 1/00; QOIM 33/S6S. 33/543; BOXJ 15/00; C07D 471/02, 235/00, 473/00, 
235/30; C07K 1/04, 17/06. 17/14 

X. CLASSIFICATION OP SUBJECT MATTER: 
US CL 

435/7.92, 7,94, 7.95, 961, 968, 973. 307; 536/26; 562/441; 436/318, 527. 807; 
525/54.1, 54.11; 422/116, 131; 530/333, 334, 335, 336, 337; 935/88 



Foim PCr/tSA/210 feontfnuaiton shaot CIKOet 1991» B. 



•ft- DESIGNATIONS OF 



Any designation of has effect in the Russian Federation. II is not yet known whether any ^ch 
designation has effect in other States of the farmer Soviet Union. 



FOB THE PURPOSBS OF INFORMATION ONLY 

Cddes used to ideoOfy Slates par^ lo the PCT on the front pages of jiamphlels publishing Intemationai 
applications under the PCT 



AT 


•Ausula 


BS 




MG 


Modagwar 


AO 


Australia 


PI 


Fbdaml 


ML 


Mali 


BB 


Basfiadas 


FR 


France 


MN 


Monpiiia 


BE 


Burkina Faso 


GA 


Gabon 


MB 


Mauritania 


BF 


OB 


Uoiied Kingdom 


MW 


Malawi 


BG 


BuJgaria 
Benin 


CN 


OiiItibii 


NL 


NBincraaiis 
Norm; 


Bi 


OR 


GfCGce 


NO 


ea 


Bnoil 


HU 




PL 


Poland 


CA 




IT 


Italr 


BO 


Romania 


CF 


OcnuaT African RcpahBe 


JP 


Jaiaa 


SO 


Sudan 


CO 


Congo 


KP 


Dcmcccalfc hoptelK KcFubOc 


SE 


Sweden 


CH 


Switzerland 




oTKocca 


6M 


Senegal 


ca 


CStcdlvoin: 


KB 


RcpiiMic of KoiGft 


su* 


Soviet Union 


CM 


Cameroon 


U 


siiusr** 


TO 


Chad 


cs 


Qoxfaasfovakla 


LK 




TG 


Togp 


OB 




LU 


LnxpnlNnifg 


IIS 


Unlicd Stains of America 


OK 


Deonark 


MC 


Monaco 
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